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A method for selecting novel proteins such as growth 
hormone and antibody fragment variants having altered bind- 
ing properties for their respective receptor molecules is pro- 
vided. The method comprises fusing a gene encoding a protein 
of interest to the carboxy terminal domain of the gene III coat 
protein of the filamentous phage Ml 3. The gene fusion is mu- 
tated to form a library of structurally related fusion proteins 
that are expressed in low quantity on the surface of a phagem- 
id particle. Biological selection and screening are employed to 
identify novel ligands useful as drug candidates. Disclosed are 
preferred phagemid expression vectors and selected human 
growth hormone variants. 
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ENRICHMENT METHOD FOR VARiANT PROTEINS WTTH ALTERED BINDING PROPERTIES 

FIELD OFTHE INVENTION 

5 This invention relates to the preparation and systematic selection of novel binding proteins having 

. altered i)inding properties for a target molecule. Specifically, this invention relates to methods for producing 

foreign polypeptides mimicking the binding activity of naturally occumng binding partners, in preferred 
embodiments, the invention is directed to the preparation of therapeutic or diagnostic compounds that mimic 
proteins or nonpeptidyl molecules such a hormones, drugs and other small molecules, particuiarly biologically active 
10 moleajles such as growth homione. 

BACKGROUND OFTHE INVENTION 

Binding partners are substances that specifically bind to one another, usually through noncovalent 
interactions. Examples of binding partners include ligand-receptor, antibody-antigen, drug-target, and enzyme- 
substrate interactions. Binding partners are extremely useful in both therapeutic and diagnostic fields. 

1 5 Binding partners have been produced In the past by a variety of methods including; han^esting them 

from nature (e.g., antibody-antigen, and ligand-receptor pairings) and by adventitious identification (e^. 
traditional drug development employing random screening of candidate molecules). In some instances these two 
approaches have been combined. For example, variants of proteins or polypeptides, such as polypeptide 
fragments, have been made that contain key functional residues that partk:ipate in binding. These polypeptkle 

2 0 fragments. In turn, have been derlvatized by methods akin to fraditional drug development. An example of such 
derivitizatton wouM mclude strategies such as cyclization to conformattonally constrain a polypeptide fragment to 
produce a novel candidate binding partner. 

The problem with prior art methods is that naturally occumng ligands may not have proper 
characteristics for all therapeutic applications. Additionally, polypeptide ligands may not even be available for 

25 some target substances. Furthemiore, methods for making non-naturaliy occurring synthetic binding partners 
are often expensive and difficult, usually requiring complex synthetk; methods to produce each candidate. The 
inability to characterize the structure of the resulting candidate so that rational drug design methods can be 
applied for further optimizatton of candidate molecules further hampers these methods. 

in an attempt to overcome these problems, Geysen (Geysen, immun. Today. 6:364-369 [1965]); and 

30 (Geysen et a/., Mol. Immun. . 23:709-715 [1 986]) has proposed the use of polypeptide synthesis to provMe a 
frameworic for systematic iterative binding partner Idenb'ficatnn and preparation. According to Geysen e/a/., 
Ibid, short polypeptides, such as dlpeptMes, are first screened for the ability to Und to a target molecule. The 
most active dipeptides are then selected for an additional round of testing comprising linking, to the starting 
dipeptlde, an addittonai residue (or by internally modifying the components of the original starting dipeptide) and 

35 then screening this set of candidates for the desired activity. This process Is reiterated until the binding partner 
having the desired properfies is Mentified. 
' The Geysen etal. method suffers from the disadvantage that the chemistry upon which it is based, 

peptide synthesis, produces molecules with ill-defined or variable secondary and tertiary structure. As rounds of 
iterative selection progress, random interactions accelerate among the various substituent groups of the 

40 polypeptide so that a true random population of interactive molecules having reproducible higher order structure 
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becomes less and less aflainaWe. For example, interactions between side chains of amino adds, vrfiich are 
sequentially widely separated but which are spatially neighbors, freely occur. Furthemiore, sequences that do not 
facilitate confbmiationally stable secondary structures provide complex peptide-sidechain interactions which may 
prevent sidechain Intefactfons of a given amino acid with the target molecule. Such complex interactions are 
5 facilitated by the flexibifityof the polyamide backbone of the polypeptide candidates. Additionally, candidates 
may exist In numerous conformations making it difficult to kjentify the confomier that interacts or binds to the 
target with greatest affinity or specificity complicating rational drug design. 

A final problem with the iterative polypeptide method of Geysen is that, at present, there are no 
practical methods with which a great diversity of different peptides can be produced, screened and analyzed. By 

1 0 using the twenty naturally occurring amino acids, the total number of all combinations of hexapepBdes that must 
be synthesized is 64.000,000. Even having prepared such a diversity of pepBdes, there are no methods available 
with whfch matures of such a diversity of peptides can be rapidly screened to select those peptides having a high 
affinity for the target molecule. At present, each "adherenr peptide must be recovered in amounts large enough 
to carry out protein sequencing. 

1 5 Ta overcome mary of the problems inherent in ttie Geysen approach, biologteal selection and screening 

was chosen as an alternative. Biological seiectionsand screens are powerful tools to probe protein function and to 
isolate variant proteins with desirable properties (Shortle, PrgtQin Enginesrinq. Oxender and Fox. eds., A.R. Uss, 
Inc.. NY, pp. 103-108 [1988D and Bowie ef a/., SclfinJS, 247:1306-1310 [1990)). However, agiven selection or 
screen is applicable to only one or a small number of related proteins. 

20 Recentiy, Smith and coworiters (Smith, ScifiDCfi. 228: 1315-1317 [1985]) and Parmley and Smith, QsD&, 

73:305-318 [1985] have demonstrated that small protein fragments (10-50 amino adds) can be "displayed' 
efficiently on the surface of filamentous phage by inserting short gene fragments into gene III of the fd phage 
("fusion phage"). The gene III minor coat protein (present in about 5 copies atone end of the virion) is important 
for proper phage assembly and for infection by attachment to the pili of £ coli (see Rasched et al. , MlcrobiQl, 

25 Rev.. 50: 401427 [1986]). Recently, fusion phage' have been shown to be useful for displaying short muteted 
peptide sequences for identifying peptides that may react with antibodies (Scotte/a/..ScifiDefi-249: 386-390, 
[1990] )and Cwiria pf^f Pmr^ Natl. Acad. U.S.A 87: 6378-6382, [1990]).or a foreign protein (Elevlin ef a/.. 
Science. 249: 404-406 [1990]). 

There are, however, several important I'lmitatidns In using such fusion phage" to identify altered 

3 0 pepBdes or proteins with new or enhanced binding properties. Rrst, it has been shown (Pannley et al,, fifiofi, 73: 
305-318, [1988]) that fusion phage are usey only for displaying proteins of less than 100 and preferably less 
tfian 50 amino add residues, because large inserts presumably disrupt the function of gene III and therefore phage 
assemWy and infectivity' Second, prior art methods have been unable to select peptides firom a library having the 
hghest binding affinity for a terget molecule. For example, after exhaustive panning of a random peptide library 

35 with an anti-p endorphin monoclonal anfibody, Cwiria and co-workers could not separate moderate affinity 
peptides (K^ - 10 ^iM) from higher affinity peptides (Kd -0.4 ^iM) fused to phage. Moreover, the parent 
endorphin peptide sequence which has very high affinity (Kd - 7nM), was not panned from the epitope library. 

Ladner WO 90/02802 discloses a method for selecting novel binding proteins displayed on the outer 
surface of cells and viral particles where It is contemplated that the heterologous proteins may have up to 184 
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amino add residues . The method contempiates isolating and amplifying the displayed proteins to engineer a new 
family of t}inding proteins having desired affinity for a target molecule. More specifically, Ladner discloses a 
"fusion phage' displaying proteins having 'initial protein binding domains' ranging from 46 residues (crambin) to 
164 residues (14 lysozyme) fused to the M13 gene III coat protein. Ladner teaches the use of proteins 'no larger 
5 than necessary" because it is easier to an'ange restriction sites In smaller amino add sequences and prefers the 58 
amino add residue bovine pancreatic trypsin inhibitor (BPTI). Small fusion proteins, such as BPTI, are preferred 
when the target is a protein or macromolecule, while larger fusion proteins, such as T4 lysozyme, are preferred for 
small target molecules such as steroids because such large proteins have defts and grooves Into which small 
molecules can fit. The preferred protein, BPTI, is proposed to be fused to gene III at the site disdosed by Smith 

10 et aL or de la Cruz e/a/., J. Biol. Chem.. 263: 4318-4322 [1988], or to one of the termlnii, along with a second 
synthetic copy of gene III so that 'some' unaltered gene III protein will be present. Ladner does not address the 
problem of successfully panning high affinity peptides from the random peptide library which plagues the 
biological selection and screening methods of the prior art 

Human growth hormone (hGH) partidpates In much of the resMation of normal human growth and 

1 5 development. This 22,000 dalton pituitary homione exhibits a multitude of biological effects induding linear 
growth (somatogenesis), lactation, activation of macrophages, insulin-like and diabetogenic effects among others 
(Chawla, R, K. (1983) Ann. Rev. Med. 34. 519; Edwards, C. K. et aL (1988) Sdence 239. 769; Thomer, M. 0., et al. 
(1988) J. Clin. Invest. fiL 745). Growth hormone deficiency in children leads to dwarfism which has been 
successfully treated for more than a decade by exogenous administration of hGH. hGH is a member of a ^mily of 

20 homologous hormones that indude placental lactogens, prolactins, and other genetic and species variants or growth 
hormone (Nicoll, G. S., e/a/., (1986) Endocrine Reviews 7. 169). hGH is unusual among these in that it exhibits broad 
species spedfidty and binds to either the doned somatogenic (Leung, D. W., etal., [1987] Nature 330. 537) or 
prolactin receptor (Boutin, J. M.,e(ar., [1988] 69). The doned gene for hGH has been expressed In a 
secreted loan in Escherichia a& (Ghana C. N., et af., (1987] lafiDS 189) and its DNA and amino add sequence has 

25 been reported (Goeddel, ef a/., [1979] Nature 281. 544; Gray, etaL, [1985] iSsflg^, 247). The three-dimensional 
structure of hGH is not available. However, the three-dimensional folding pattern for porcine growth hormone 
(pGH) has been reported at moderate resolution and refinement (Abdel-Meguid. S. S., et aL, (1987] Proc. Natl. 
Acad Sci. USA 84. 6434). Human growth hormone's receptor and antibody epitopes have been identified by 
homolog-scanning mutagenesis (Cunningham etaL, Sdence 2S3i 1330, 1989). Tlie structure of novel amino temiinal 

3 0 methionyl bovine gowth hormone containing a spliced-in sequence of human growth homione inducfing histidine 18 
and histidine 21 has been shown (U.S. Patent 4,880,910) 

Human growth hormone (hGH) causes a variety of physiological and metabolic effects in various animal 
models including linear bone growth, lactation, activation of macrophages, insulin-like and diabetogenic effects and 
others (R. K. Chawla etaL, Anna. Rev. Med. 34, 519 (1983); 0. G. P. Isaksson etaL, Amu. Rev, Physiol. 47, 483 

35 (1985); C. K. Edwards etaL, Science 239, 769 (1988); M. 0. Thorner and M. L. Vance, J. Clin. Invest. 82, 745 

(1 988); J. P. Hughes and H. G. Friesen, Ann. Rev PhysioL 47, 469 (1985)). These biological effects derive from 
the interaction between hGH and spedfic cellular receptors.. 

Accordingly, it is an object of this invention to provide a rapid and effective method for the systematic 
preparation of candidate binding substances. 
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It is another object of this invenlion to prepare candidate binding substances dteplayed on surface of a 
phagemid particle that are confotmationaliy stable. 

It is another tit^ of tiis invention to prepare candidate binding substances comprising fusion proteins 
of a phagecoat protein and a heterologous polypeptide where the polypeptide is greater than 100 amino acids in 
5 length and may be more than one subunit and is displayed on a phagemid particle where the polypeptide is encoded 
by the phagemid genome. 

It is a further ot)ject of Ihis invention to provide a method for the preparation and selection of binding 
substances that is suffidentiy versatile to present, or display, ail pepBdyl moieties that could potentially 
participate in a noncovalent binding interaction, and to present these moieties h a fashion that is sterically 
10 confined. 

Stilt another object of the invention is the production of growth honnone variants that erfibit stronger 
affinity for growth honnone receptor and birefing protein. 

It is yet another object of this invention to produce expression vector pha^mids ftat contain a 
suppressive teminalion codon functionally kxated between the heterologous polypeptide and the phage coat 
15 pro|einsuchthatdeteclablefiJSionproteinisproducedtaahostsifl)pressorceBandonVth^ 
polypepfide is produced in a non-suppressor host cell. 

Rnally, it is an object of this invention to produce a phagemid particle that rarely displays more than one 
copy of candidate binding proteins on the outer surface of the phagemid particle so that efficient selection of hi^ 
affinity binding proteins can be achieved. 
20 These and other oljects of this invention will be apparent from consideration of the invention as a whole. 

SUMIWABYOFTHE INVEhfTlON 
These objectives have been achieved by providing a method for selecting novel binding polypeptides 
comprising: (a) constnjcting a repficaWe expression vector comprising a first gene encoding a polypeptide, a 
second gene encoding at least a portion of a natural or wild-type phage coat protein wher^ the first and second 
25 genes are heterologous, and a transcription regulatory element operaWy linked to the firstand second genes, 
thereby fonning a gene iusfen encoding a fusion proiBin; (b) mutating the vector at one or more selected positions 
within the first gene thereby foraiing a family of related piasmids; (c) transfbnning suitable host cells with the 
plasmids; (d) infedir^ the transfomied host ceHs with a helper phage having agene encoding the phage coat 
protein; (e) culturing the transfomied infected host cells under concfitions suiteWe for fonning recomKnanl 
3 0 phagemid particles containnig at least a portion of the plasmid and capable of transfonning the host the 

condifions acQusfed so that no more than a minor amount of phagemid parfides display more than one copy of the 
fusion protein ot the surface of the partide; (0 contacting the phagemid particles with a target molecule so that 
A least a portion of the phagemid particles bind to the target molecule; and (g) separating the phagemid 
particles that bind from those that do not Preferably, the method further comprises transforming suitable host 
35 cells vwth recombinant phagemid partides that bind to the tai^t molecule and repeating steps (d) through (g) 
one or more times. 

Additionally, the method for selecting novel binding proteins where the proteins are composed of more 
than one subunit is achieved by selecting novel binding peptides comprising constmcting a replicable expression 
vector comprising a transcription regulatory element operaWy linked to DNA encoding a protein of interest 
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containing one or more subunits, wherein the DNA encoding at least one of the subunlts is fused to the DNA 
encoding at least a portion of a phage coal protein:mutaling the DNA encoding the protein of irteresl at one or 
more selected positions thereby forming a family of related vectors; transforming suitable host cells with the 
vectors; infecting the transformed host cells with a helper phage having a gene encoding the phage coat protein; 
5 culturing the transformed infected host cells under conditions suitable for forming recombinant phagemid 

particles containing at least a portion of the plasmid and capable of transforming the host, the conditions adjusted 
so that no more than a minor amount of phagemid particles display more than one copy of the fusion protein on the 
surface of the particle; contacting the phagemid particles with a target molectrie so that at least a portion of the 
phagemid particles bind to the target molecule; and separating the phagemid particles liat bind from those lhat 
10 donoL 

Preferably in the method of this invention the plasmid is under tight control of the transcription 
regulatory element, and the culturing conditions are adjusted so that the amount or number of phagemid particles 
displaying more lhan one copy of the fusion protein on the surface of the particle is less than about 1 %. Also 
preferably, amount of phagemid particles displaying more than one copy of the fusion protein is less than 10% the 
1 5 amount of phagemid particles displaying a single copy of the fusion protein. Utost preferably the amount is less 
than 20%. 

Typically, in the method of this invention, the expression vector will further contain a secretory signal 
sequences fused to the DNA encoding each subunit of the polypeptide, and the transcription regulatory element 
will be a promoter system. Prefenfed promoter systems are seleded from; Lac Z, Xpl, TAC, T 7 polymerase, 

20 tryptophan, and allaline phosphatase promoters and combinations thereof. 

Also typically, the first goie will encode a mammalian protein, preferably the protein wn be selected 
from; human growft hormone(hGH), N-melWonyl human growth hormone, bovine growth homrone. parathyroid 
homione, fcyroxine, insulin A-chain, Insulin B^shain, proinsuBn, lelaxin A-chain, relaxin B<hain. prorelaxin. 
glycoprotein hormones such as follicte sSmulaling hormone(i=SH), thyroid stimulating hormonefTSH), and 

25 leutinizing honnone(LH), glycoprotein hormone receptors, calcitonin, glucagon, factor VIII, an antibody, lung 
surfactant, uroWnase, streptokinase, human tissue-type plasminogen activator (t-PA), bombesin, factor IX, 
thrombin, hemopofetic growth factor, tumor necrosis factor-alpha and -beta, enkephalinase, human semm albumin, 
muBerian-inhibiting substance, mouse gonadotropin-associated peptide, a microbial protein, such as 
betalactamase, tissue factor protein, inhibin, activin, vascular endothelial growth factor, receptors for homiones 

30 or growth factors; integrin, thrombopoielin. protein A or D, rtieumatoid factors, nerve growth factors such as 
NGF-p, platelet-growth factor, transforming grovirth factors (TGF) such as TGF-alpha and TGF-beta, insiJin- 
like growth factor-l and -II, insulin-like growth factor binding proteins , CD4, DNase, latency associated peptide, 
erythropoietin, osteoinductive factors, interferons such as interferon-alpha, -beta, and -gamma, cotony 
stimulating factors (CSFs) such as M-CSF. GM-CSF. and 6-CSF. interieuidns (ILs) such as IL-1, IL-2. IL-3, IL- 

35 4, superoxide dismutase; decay acceterating factor, viral antigen, HIV envetope proteins such as GP120, GP140, 
atrial natriuretic peptides A, B or C, immunogtobulins, and fragments of any of the above-listed proteins. 

Preferably the first gene will encode a polypeptide of one or more subunits containing more than about 
100 amino acid residues and virill be folded to form a plurality of ri^d secondary structures displaying a plurality 
of amino acids capabte of Interacting with the target. Preferably the first gene will be mutated at codons 
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corresponding to only the amino acids capable of Interacting with the target so that the integrity of the rigid 
secondary structures will be preserved. 

Normally, the method of this invention will employ a helper phage selected from; M13K07, M13R408, 
M13-VCS, and Phi X 174. The preferred helper phage is Mt3K07, and the prefenred coat protein is the M13 
5 Phage gene III coat protein. The prefen-ed host is E. coli, and protease deficient strains of E colL Novel hQH 
variants selected by the method of the present invention have been detected. Phagemid expression vectors were 
constructed that contain a suppressible temnination codon functionally located between the nucleic acids encoding 
the poiypept'de and the phage coat protein. 

BHEF DESCRIPTION OFTHE RGURES 
1 0 FIGURE 1. Strategy for displaying large proteins on the surface of filamentous phage and enriching for 

altered receptor binding properties. A plasmid, phGH-M13gtll was constructed that fuses the entire coding 
sequence of hGH to the carboxyl tenninal domain of M13 gene lit. Transcription of the ftjslbn protein is under 
control of the lac promoter/operator sequence, and seoBtion is directed by the stil signal sequence. Phagemid 
particles are produced by infection with the 'helper* phage, M13K07, and particles displaying hGH can be 
1 5 enriched by binding to an affinity matrix containing the hGH receptor. The wild-type gene III (derived from the 
M13KQ7 phage) is diagramed by 4-5 copies of the multiple arrows on the tip of the phage, and the fusion protein 
(derived from the phagemid, phGH-liA13glll) is indicated schematically by the folding diagram of hGH replacing 
the arrow head. 

FIGURE 2. Immunobbtofwhole phage particles shows tiiathGHcomigrates with phage. Phagemid 

20 particles purified in a cesium chloride gradient were loaded into duplicate wells and electrophoresed through a 1% 
agarose gel in 375 mM Tris. 40 mM glycine pH 9.6 buffer. The gel was soaked in transfer buffer (25 mM Tris, pH 
8.3, 200 mM glycine, 20% methanol) containing 2% SDS and 2% p-mercaptoethanol for 2 hours, then rinsed in 
transfer buffer for 6 hours. The proteins in tiie gel were then electrobiotted onto immobilon membranes 
(Miliipore). The membrane containing one setofsamples was stained wittiCoomassie blue to showthe position of 

25 the phage proteins (A). The duplicate membrane was knmuno-stsuned for hGH by reacting the membrane with 
polyclonal rabbit anti-hGH antibodies followed by reaction with horseradish peroxidase conjugated goat anfi- 
labbit IgG antibodies (B). l.ane 1 contains flie M13K07 parent phage and is visible only in the Coomassfe blue 
stained membrane, since it lacks hGR Lanes 2 and 3 contain separate preparations of the honnone phagemid 
particles which Is visible both by Coomassle and hGH immuno-stafning. The difference in migration distance 

30 between the parent Mt3K07 phage and honnone phagemid particles reflects the different size genomes that are 
packaged witiiin (8.7 kb vs. 5.1 kb, respectively). 

FIGURE 3. Summary diagram of steps in the selection process for an hGH-phage library randomized at 
codons 172, 174, 176, and 178. The template molecules, pH0415, containing a unique Kpnl restriction site and the 
hGH(R1 780,11 79T) gene was mutagenized as described in the text and electrofransfomied into £ coli strain 

35 WJM101 to obtain the initial phagemid library, Library 1. An aliquot (approximately 2%) from Library 1 was used 
directly in an initial selection round as described in the text to yield Library 1G. Meanwhile, double-stranded DNA 
(dsDNA) was prepared from Library I, digested with restriction enzyme Kpnl to eliminate template background, 
and electrotransformed Into WJM101 to yield Library 2. Subsequent rounds of selection (or Kpnl digestion, 
shaded boxes) followed by phagemid propagation were carried out as indicated by the arrows, according to the 
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procedure described in the text. Four independent clones from Library 4G^ and four independent clones from 
Library 5G^ were sequenced by dideoxy sequencing. All of these clones had the identical DNA sequence, 
corresponding to the hGH mutant (Glu 174 Ser, Phe 176 Tyr). 

FIGURE 4. Structural model of hGH derived from a 2.8 A folding diagram of porcine growth hormone 
5 detemiined crystallographicaily. Location of residues in hGH that strongly modulate Its binding to the hGH- 
binding protein are within the shaded circle. Alanine substitutions that cause a greater than tenfold 
reduction(9), a four- to tenfold reduction {•), or Increase (O), or a two- to fourfold reduction (•), in binding 
affinity are indicated. Helical wheel projections in the regions of a-helix reveal their amphipalhic quality. 
Blackened, shaded, or nonshaded residues are charged, polar, or nonpoiar, respectively, h helix-4 the most 

1 0 important residues for mutation are on the hydrophilic face. 

FIGURE 5. Amino acid substitutions at positions 172, 1 74, 1 76 and 1 78 of hGH (The notation, e.g. 
KSYR, denotes hGH mutant 172K/174S/176Y/178R.) found after sequencing a number of clones from rounds 1 
and 3 of the selection process for the pathways indicated (hGH elution; Glycine elution; or Glycine elution after 
pre-adsorption). Non-functional sequences (i.e. vector background, or other prematurely terminated and/or 

1 5 frame-shifted mutants) are shown as 'NP. Functional sequences which contained a non-silent, spurious mutation 
(i.e. outside the set of target residues) are marked with a "+". Protein sequences which appeared more than once 
among ail the sequenced clones, but with different DNA sequences, are marked with a "#". Protein sequences 
whk:h appeared more than once among the sequenced ctones and with the same DNA sequence are marked with a 
Uoie that after three rounds of selection, 2 different contaminating sequences were found; these clones did 

20 not con^espond to cassette mutants, but to previously constnKted hormone phage. The pS0643 contaminant 
corresponds to wild-type hGH-phage (hGH "KEFR"). The pH0457 contaminant, which dominates the third- 
round glycine-selected pool of phage, corresponds lo a previously identified mutant of hGH, "KSYR" The 
ampllficatton of these contarrtinants emphasizes the ability of the hormone-phage selection process to select for 
rarely occuning mutants. The convergence of sequences is also striking in all three pathways: R or K occurs most 

25 often at positions 172 and 1 78; Y or F occurs most often at position 176; and S, T, A, and other residues occur at 
posltton 174. 

FIGURE & Sequences from phage selected on hPRLbp-beads In the presence of zinc. The notation is 
as described in Figure. 5. Here, the convergence of sequences is not predictable, but there appears to be a bias 
towards hydrophobic sequences under the most stringent (Glycine) selection conditions; L ,W and P reskiues are 
3 0 frequently found in this pool. 

FIGURE 7. Sequences from phage selected on hPRLbp-beads in the absence of zinc. The notation is as 
described in Figure 5. In contrast to the sequences of Figure. 6, these sequences appear more hydrophilic. After 
4 rounds of selection using hGH elution, two cbnes (ANHQ, and TLDT/1 71 V) dominate the pool. 

FIGURES. Sequences from phage selected on blank beads. The notatton is as described in Fig. 5^ After 
35 three rounds of selection with glycine elution, no siblings were obsen/ed and a background level of non-functional 
sequences remained. 

RGURE 9. Construction of phagemid fl on from pH0415. This vector for cassette mutagenesis and 
expression of the hGH-gene III fusion protein was constmcted as follows. Piasmid pS0643 was constructed by 
oligonucleotide-directed mutagenesis of pS0132, which contains pBR322 and fl origins of repltoation and 
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expresses an hGH^ ill fusion protein (hQH residues M91 , followed by a single Gly residue, lused to Pro-198 
of gene III) under ttie conttol of the £Mflt!2& promoter. HAjtagenesis was carried out with liie oligonucleotide 
5'-GGC-AGC-TGT-GGC-TEJJ[&fi:AGT-GGC-GGC-GGC-TCT-GGT-3*. which introduced a ifljalsite 

(underftied) and an amber stop codon (TAG) following Phe-191 of hGH. 
5 FIGURE ia A. Diagram of plasmid pDH188 insert containing the DNA encoding the light chain and 

heavy chain (variable and constant domain 1 ) of the Fab humanized antibody directed to the HER-2 receptor. Vl 
and Vh are the variable regions for the light and heavy chains, respectively. Ck is the conslattt region of tfie human 
lappa light chain. CHIqi is the first constant region of the human gamma 1 chain. Both coding regions start with 
the bacterial st II signal sequence. B. A schematic diagram of the entire plasma pDH188 contairtfng the Insert 
1 0 described in 5A. After transformation of the plasmid Into E co«f SR101 eels and the addffion of helper phage, 
the plasmid is packaged into phage particles. Some of these particles display the Fab-p ilf fusion (where p III is 
the prote'm encoded by the M13 gene III DNA). Tfie segments in the plasmid figure correspond to the Insert shown 
inSA. 

FIGURE llAttHtxigh C are coleGiively lefened to here as Figure ii. The nucleotide (Seq. ID No. 25) 
15 sequenceoftheDNAenoodingihe4D5Fabmoleculeexpressedcnthephagemldsuriace. The amino acid sequence 
of the light chainis also shown (Seq. ID No. 26). as Is the amino add setfjence of the heavy chain p III fusion (Seq. ID 
No. 27). 

FIGURE 12. Enri(*iment of wild-type 4D5 Fab phagemid from variant Fab phagemW. Mixtures of wild- 
type phagemid and variant 4D5 Fab phagemW in a ratio of 1:1 .000 were selecledon plates coated with the extra- 

20 c^lubr domain protein of the HER-2 receptor. After each round of selectton, a portion of the eluted phagemid 
were infected Info £ coffand plasmid DNA vras prepared. This plasmid DNA was then digested with Eco RV and 
Pst I. separated on a 5% polyacoriamide gel, and stained virith ethidium bromide. The bands were visualized under 
UV light. The bands due to the wild-type and variant plasmids are mariced witti arrows. The firet round of 
selection vras eluted only under acid conditions; subsequent rowids were etutediwilh either an add elirf on (left 

25 side of Rgure) or with a humanized 4D5 antibody wash step prior to acid elution (right side of Rgure) using 
methods described in Example m Three variant 4D5 Fab molecules were made: H91A(aminoaddWstidiheal 
position 91 on the Vl chain mutated to alanine; indicated as W lanes In Rgure), Y49A (amino add tyrosine at 
posiBon 49 on the Vl diain mutated to alanine; indicated as B' lanes in the Rgure), and Y92A (amino add ^sine 
at position 92 on the Vl chain mutated to alanhe; indicated as -C' lanes in the Figure). Amino add position 

30 numbering is according to Kabat ef sS^Secfimces dprotBbs ofmmmotes^ interest, 4th ed.. U.S. Dept of 
Health and Human Senrices. PubTic Health Service, NarLtnsiitute of Health. Bethesda, MD [1987)). 

FIGURE 13. The Scatchard analysis of the RIA affinity detemiination described in Experimental 
Protocols is shown here. The amount of labeled ECD antigen that is bound is shown on the x-axis while the amount 
that is bound divided by the amount that is free is shown on the y-axis. The stope of the yne incficates the Ka; the 

35 calculated Kd is 1/Ka- 
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DETAILED DESCRIPTION OF THE INVENTION 

The following discussion will be best understood by referring to Figure 1 . In its simplest form, the 
method of the instant invention comprises a method for selecting novel binding polypeptides, such as protein 
ligands, having a desired, usually high, affinity for a target molecule from a library of structurally related binding 
5 polypeptides. The library of structurally related polypeptides, fused to a phage coat protein, is produced by 
mutagenesis and, preferably, a single copy of each related polypeptide is displayed on the surface of a phagemid 
particle containing DNA erxxxiing that polypeptide. These phagemid particles are then contacted with a target 
molecule and those particles having the highest affinity lor the target are separated from those of lower affinity. 
The high affinity binders are then amplified by infection of a bacterial host and the competitive binding step is 
1 0 repeated. This process is reiterated until polypeptides of the desired affinity are obtained. 

The novel binding polypeptides or ligands produced by the method of this invenlion are useful per se as 
diagnostics or therapeutics ( eg. agonists or antagonists) used in treatment of biological organisms. Structural 
analysis of the selected polypeptides may also be used to fadOtate rational drug design. 

By 'binding polypeptide" as used herein is meant any polypeptide that binds with a selectable affinity to 
1 5 a target molecule. Preferably the polypeptide will be a protein that most preferably contains more than about 
100 amino add residues. Typically the polypeptide will be a hormone or an antibody or a fragment thereof. 

By "high affinity" as used herein Is meant an affinity constant (K(j ) of <10*5 M and preferably <10"7m 
under physiological conditions. 

By "target molecule" as used herein is meant any molecule, not necessarily a protein, for which it is 
20 desirable to produce a ligand. Preferably, however, the target will be a protein and most preferably the target 
will be a receptor, such as a hormone receptor. 

By *humanized antibody" as used herein is meant an antibody bfi which the compiementarity-detennlning 
regions (CDRs) of a mouse or other non-human antibody are grafted onto a human antibody frameworic. By human 
antibody frameworic is meant the entire human antibody exduding the CDRs. 

25 L Choice of Potvoeptides for DisDiav on the Surface of a Phage 

The first step in the method of this invention is to choose a polypeptide having rigid secondary 
structure exposed to the surface of the polypeptide for display on the surface of a phage. 

By "polypeptide" as used herein is meant any molecule whose expression can be directed by a spedfic 
DNA sequence. The polypeptides of this invention may comprise more than one subunit, where each subunit is 
3 0 encoded by a separate DNA sequence. 

By "rigid secondary structure' as used herein is meant any polypeptide segment exhibiting a regular 
repeated structure such as is found in; a-helices. 3io helices, ic-helices, parallel and antiparallel p-sheets, and 
reverse turns. Certain "non-ordered" structures that lack recognizable geometric order are also induded in the 
definition of rigid secondary structure provided they form a domain or "patch" of amino acid residues capable of 
35 interaction with a target and that the overall shape of the structure is not destroyed by replacement of an amino 
add within the structure . It is believed that some non-ordered structures are combinations of reverse turns. The 
geometry of these rigid secondary structures is well defined by <{» and y torsional angles about the a-carbons of 
the peptide "backbone". 
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The requirement that the secondaiy structure be exposed ta the surface of the polypeptide Is to 
provide a domain or "patch" of amino acki residues that can be exposed to and bind with a target molecule. It is 
primarily these amino add residues that are replaced by mutagenesis that form the "library" of structurally 
related (mirtant) binding polypeptides that are displayed on the surface of the phage and from which novel 
5 polypeptide ligands are selected. Mutagenesis or replacement of amino acid residues directed toward the Interior 
of the polypeptide is generally avoided so that the overall stnjcture of the rigid secondary structure is presented. 
Some replacement of amino acids on the interior region of the rigid secondary structures, especially with 
hydrophobic amino acid residues, may be tolerated since these conservative substitutions are unlikely to distort 
the overall structure of tiie polypeptide. 

1 0 Repeated cycles of "polypeptide" selection are used to select for higher and higher affinity binding by 

the phagemid selection of multiple amino add dianges whidi are selected by multiple selection cydes. Fbllowng a 
first round of phagemid selection. Involving a first region or selection of amino adds in the ligand polypeptide, 
additional roundsof phagemH selection In oflier regions or amino adds of the Ogand polypeptide are conducted. 
The cydes of phagemid selection are repeated until tiie dedred afffnity properties of the ligand polypeptide are 

15 achieved. To illustrate Ws process. Example Vllf phagemid selection of hGHw^ Infliefirst 
cyde hGH amino adds 172, 174, 17B and 1 78 were mutated and phagemid selected. In a second cycle hGH amino 
adcfe 167, 171, 175 and 179 were phagemid selected. In a third hGH amino adds 10. 14, 18 and 21 were 
phagemid seleded. Optimum amino add changes from a previous cycle may be incorporated into tiie polypeptide 
before the next cyde of selection. For example, hGH amino adds substitution 174 (serine) and 176 (tyrosine) 

20 were incorporated into ttie hGH before the phagemid selection of hGH amino adds 167, 1 71 , 1 ^ and 1 79. 

From ttie forgoing it wilt be appreciated that the amino add residues tiiat fonn the binding domain of 
the polypeptide will not be sequentially linked and may reside on different subunits of the polypeptide. That is, 
the binding domain tracks with the particular secondary structure at the binding site and not the primary 
structure. Thus, generally, mutations will be introduced into codons encoding amino acWs within a particular 

25 secondary structure at sites directed away from the interior of the polypeptkle so that they will have the 

potential to interact viflh the terget. By way of lllustraton, Rgure 2 shows the location of residues in hGH that 
are known to sftongly modulate Its binding to the hGH-binding protein (Cunningham efa/., SciSDCS 247:1461- 
1465 [1990]). Thus representative sites suitable tor mutagenesis would Indude resklues 172, 174, 176, and 178 
on hellx-4, as well as reddue 64 located In a "non-ordered" secondary struct^^^ 

30 There is no requirement that the polypeptide chosen as a ligand to a target normally Knd to that target. 

Thus, for example, a glycoprotein honnone such as TSH can be chosen as a liga^ 
library of mutant TSH molecules are employed in the raetiiod of this Inventton to produce novel dmg candidates. 

TWs invention thus contemplates any polypeptide that binds to a target molecule, and indudes 
antibodies. Preferred polypeptides are those that have pharmaceutfcal utility. More preferred polypeptides 

35 indude; a growtii hormone, induding human growth homione, des-N-mettikxiyl human growth hormone, and bovine 
growth hormone; parafliyrokl hormone; thyroM stimulating homione; tiiyroxine; insulin A-chain; insulin B-diain; 
proinsulin; follicle stimulating hormone; calcitonin; leutinizing hormone; glucagon; factor Vill; an antibody; lung 
surfactant; a plasminogen activator, such as urokinase or human tissue-type plasminogen activator (t-PA); 
bombesin; factor IX, thrombin; hemopoiette growth factor; tumor necrosis factor-alpha and -beta; enkephaiinase; a 
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serum albumin such as human serum albumin; mUlerian-inhibiting substance; relaxin A-chain; relaxin B-chain; 
prorelaxin; mouse gonadotropin-associated peptide; a microbiai protein, such as betaiactamase; tissue factor 
protein; inhibin; activin; vascular endothelial growth factor; receptors for honnones or growth factors; integrin; 
thrombopoietin; protein A or D; rheumatoid factors; nerve growth factor such as NGF-P; platelet-derived growth 
5 factor; fibroblast growth factor such as aFGF and bPGF; epidennal growth factor; transforming growth factor 
(TGF) such as TGF-alpha and TQF43eta; insuOn-iike growth feK^tor-l and -II; insulin-like growth factor binding 
proteins; CD-4; DNase; latency associated peptide; erythropoietin; osteoinductive factors; an interferon such as 
interferon-alpha, -beta, and -gamma; colony stimulating factors (CSFs), e.g., M-CSF, GM-CSF, and G-CSF; 
interleukins (ILs), e.g., iL-1, IL-2, IL-3, IL-4, etc.; superoxide dismutase; decay accelerating factor; atrial 

1 0 natriuretic peptides A, B or C; viral antigen such as, for example* a portion of the HIV envelope; immunoglobulins; 
and fragments of any of the above-listed polypeptides. In addition, one or more predetermined amino add 
residues on the polypeptide may be substituted, inserted, or deleted, for example, to produce products with 
improved biological properties. Further, fragments of these polypeptides, especially bidogicalty active 
fragments, are Included. Yet more preferred polypeptides of this invention are human growth homione , and 

1 5 atrial naturetic peptides A, B, and C, endotoxin, subtilisin, trypsin and other serine proteases. 

Still more preferred are polypeptide hormones that can be defined as any amino acid sequence produced 
in a first cell that binds specifically to a receptor on the same cell type (autocrine hormones) or a second cell type 
(non-autocrine) and causes a physiobgical response characteristic of the receptor-bearing cell. Among such 
polypeptide hormones are cytokines, lymphokines, neurotrophic hormones and adenohypophyseal polypeplkie 

2 0 homiones such as growth hormone, prolactin, placental lactogen, luteinizing hormone, fbllk^ie-stimulating hormone, 

thyrotropin, chortonic gonadotropin, corticotropin, a or p-melanocyte-slimulating hormone, p-lipolropin, y- 
lipotropin and the endorphins; hypothalmk: release-inhibiting hormones such as corticotropin-reiease factor, 
growth honnone release-Inhibiting hormone, growth hormone-release facton and other polypeptUe hormones such 
as atrial natriuretic peptides A, B or C. 
25 B. Qfatalhing a Rist Gone fGene 1^ encoding the desired poiynenUde 

The gene encoding the desired polypeptMe (i.e., a polypeptkle with a rigkl secondary structure) can be 
obtained by methods known in the art (see generally, Sambrook et al. , Molecular BiQtogy: A LabQratQfy Manual. 
Cold Spring Harbor Press, Cold Spring IHarbor, New York [1989]). If the sequence of the gene is known, the 
DNA encoding the gene may be chemteally synthesized {MerrfiekJ, J. Am. Chem. Soc. as 2149 [19631). » the 

3 0 sequence of the gene is not known, or if the gene has not prevtously been isolated, it may be cloned from a cDNA 

library (made from RN A obtained from a suitable tissue in which the desired gene Is expressed) or from a suitable 
genomic DNA library. The gene is then isolated using an appropriate probe. For cDNA libraries, suitable probes 
include monoclonal or polyclonal antibodies (provWed that the cDNA library is an expression library), 
oligonucleotides, and complementary or homologous cDNAs or fragments thereof. The probes that may be used to 
35 isolate the gene of interest from genomic DNA libraries include cDNAs or fragments thereof that encode the same 
or a similar gene, homok)gous genomte DNAs or DNA fragments, and oligonudeotkies. Screening the cDNA or 
genomic library with the selected probe is conducted using standard procedures as described in chapters 10-12 
of Sambrook ef a/., supra. 
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An alternative means to fsolating the gene encoding Hie protein of interest Is to use polymerase chain 
reaction methodology (PGR) as described in section 14 of Sambrook ef a/., supra. This method requires the use 
of oligonucleotides that will hybridize to the gene of interest; ftus, at least some of the DNA sequence for this 
gene must be known in order to generate the oligonucleofdes. 
5 After the gene has been isolated, it may be Inserted into a suitable vector (preferably a piasmid) for 

amplification, as described generally in Sambrook etaL, supra. 
BL f,r^i<rtTuctinff RepjirflMP gYpresslon Veclore 

While several types of vectors are available and may be used to practice this invention, piasmid vectors 
are the preferred vectors for use herein, as they may be constructed with relative ease, and can be readily 
1 0 amplified. Piasmid vectors generally contain a variety of components including promoters, signal sequences, 
phenotypic selection genes, origin of replicatfon sites, and other necessary componente as are known to those of 
ordinary skill in the art. 

Promoters most commonly used in prokaryotic vectors include theMZpromoter system, the alkaline 
phosphatase Bto A promoter, the bacteriophage XPL promoter (a temperature sensitive promoter), the lac 

15 promoter (a hybrid fiBrJac promoter that is regulated by the lac repressor), the tryptophan promoter, and the 
bacteriophage T7 promoter. For general descripttonsof promoters, see section 17 of Sambrook et al. supra . 
WhOe these are the most common^ used promoters, other suitable microlxal promoters may be used as well. 

Prefened promoters for pracficing this invention are those that can be t^htly regulated such that 
expresston of the fusion gene can be controlled. It is believed that the problem that went unrecognized in the 

20 prior art was that display of multiple copies of the fusion protein on the surface of the phagemid particle lead to 
multipoint attachment of the phagemid with the target It is believed this effect, referred to as the "chelate 
effect", results in selection of false "high affinity" polypeptides when multiple copies of the fusion protein are 
displayed on the phagemid particle in close proximity to one another so that the target was "chelated". When 
multipoint attachment occurs, the effecfive or apparent Kd may be as high as the product of the indivBual Kds 

25 for each copy of the displayed fusfon protein. This effect may be the reason Cwiria and cowortcers supra were 
unable to separate moderate affinity peptides from higher affinity peptides. 

It has been discovered that by tightly regutating expresston of the fusfon protein so that no more than a 
minor amount, i.e. fevrer than about 1%, of the phagemid particles contain multiple copies of the fusfon protein the 
-chelate elfecT is overcome aitowing proper selection of high affinity polypeptides. Thus, depending on the 

30 promoter, culturing conditions of the host are adjusted to maximize the number of phagemid particles confaining a 
singte copy of the fusion protein and minimize the number of phagemid partteles oonteining multiple copies of the 
fusion protein. 

Preferred promoters used to praclfce this invention are the lac Z promoter and the cto A promoter. 
The lac Z promoter is regulated by the lac repressor protein lac i, and thus transcription of the fusion gene can be 
35 controlled by manipulation of the level of the lac repressor protein. By way of illustratfon, the phagemid containing 
the lac Z promotor Is grown in a ce« strain that contejns a copy of the lac i repressor gene, a repressor tor the 
lacZ promotor. Exemplary cell strains containing the laci gene include JM 101 and XL1 -blue. In the alternative, 
the host cell can be colransfected with a piasmid containing both the repressor lac i and the lacZ promotor. 
Occasionally both of the above techniques are used simultaneously, that is, phagmide partfoles containing the lacZ 
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promoter are grown in cell strains containing the lac i gene and the cell strains are cotransfected with a piasmid 
containing both the Jac Z and lac i genes. Nomnaiiy when one wishes to express a gene, to the transfected host 
above one would add an inducer such as isopropylthiogalactoside (IPTG). In the present invention however, tNs 
step is omitted to (a) minimize the expression of the gene III fusion protein thereby minimizing the copy number 
5 (i.e. the number of gene 111 fusions per phagemid number) and to (b) prevent poor or improper packaging of the 
phagemid caused by inducers such as IPTG even at low concentrations. Typically, when no inducer is added, ttie 
number of fusion proteins per phagemid particle is about 0.1 (number of bullc fusion proteins/number of phagemid 
particles). The most prefened promoter used to practice this invention Is obfi A. This promoter is believed to be 
regulated by the level of inorganic phosphate In the cell where the phosphate acts to down-regulate the activity of 

10 the promoter. Thus, by depleting cells of phosphate, the activity of the promoter can be increased. The desired 
result is achieved by growing cells in a phosphate enriched medium such as 2YT or LB thereby controlling the 
expression of the gene III fusion. 

One other useful component of vectors used to practice this invention is a signal sequence. This sequence 
is typically located immediately 5' to the gene encoding the fusion protein, and will thus be transaibed at the amino 

1 5 tenninus of the fusion protein. However, in certain cases, the signal sequence has been demonstrated to be located 
at positions other 5' to the gene encoding the protein to be secreted. This sequence targets the protein to which 
it is attached across the inner membrane of the bacterial cell. The DNA encoding the signal sequence may be 
obtained as a restriction endonudease fragment from any gene encoding a protein that has a signal sequence. 
Suitable prokaryotic signal sequences may be obtained from genes encoding, for example, LamB or OmpF (Wong 

20 et al.. Gene . 68:193 [1983]), MalE, PhoA and other genes. A prefened prokaryotic signal sequence for practicing 
this invention is the £ coll heat-stable enterotoxin II (STII) signal sequence as described by Chang et al, , £sd&, 
55: 189 [1987]. 

Another useful component of the vectors used to practk» this invention is phenotypk: selection genes. 
Typical phenotypic selection genes are those encoding proteins that confer antibiotic resistance upon the host cell. 
25 By way of illustratton, the amptollRn resistance gene (aoiB), and the tetracycline resistance gene (tat) are readily 
emptoyed for this purpose. 

Constmction of suitable vectors comprising the aforementioned components as well as the gene encoding 
the desired polypeptide (gene 1) are prepared using standard recombinant DNA procedures as described in 
Sambrook era/, supra. Isolated DNA fragments to be combined to fomi the vector are cleaved, tailored^ and 
30 ligated together in a spedfic order and orientation to generate the desired vector. 

The DNA is deaved using the appropriate restriction enzyme or enzymes in a suitable buffer. In general, 
about 0.2-1 \ig of piasmid or DNA fragments is used with about 1-2 units of the appropriate restriction enzyme 
in about 20 ^1 of buffer solution. Appropriate buffers, DNA concentrations, and incubation times and 
temperatures are spedfied by the manufacturers of the restriction enzymes. Generally, incubation times of about 
35 one or two hours at 37*C are adequate, although several enzymes require higher temperatures. After incubation, 
the enzymes and other contaminants are removed by extraction of the digestion solution with a mixture of phenol 
and chlorofomn, and the DNA is recovered from the aqueous fraction by precipitation with ethanol. 

To ligate the DNA fragments together to fonn a functional vector, the ends of the DNA fragments 
must be compatible with each other, in some cases, the ends will be directly compatible after endonudease 
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digesfion. However, it may be necessaiy to first convert the sticky erxte commonly produced by endonuclease 
digestion to blunt ends to make Ihera compatibfe for Bgalion. To Kurt the ends, the DNA is treated In a suitable 
buffer for at least 15 minutes at 15'C with 10 units of of the Klenow fragment of DNA polymerase I (Kienow) in 
the presence of the four deoxynucleotide triphosphates. The DNA is then purified by phenol-chlorofonn 
5 extraction and ethane! preapi^tion. 

The cleaved DNA fragments may be size-separated and selected using DNA gel electrophoresis. The 
DNA may be electrophoresed ttirough either an agarose or a polyacryiamide matbc. The selection of the matrix 
will depend on the size of the DNA fragments to be separated. After electrophoresis, the DNA is extracted from 
the matrix by electroelution, or, if low-melting agarose has been used as the matrix, by meWng the agarose and 

1 0 extracting the DNA from it, as described In secfions e.30-6.33 of Samfarook et aL, supra. 

The DNA fragments that are to be ligated together (previously digested with the appropriate 
restriction enzymes such that the ends of each fragmert to be ligated are compatible) are put in solution in about 
equimolar amourts. The solution wiU also contain ATP, Ggase buffer and a ligase such as T4 DNA ligase at about 
10 units per 0£ jig of DNA. If the DNA fragment is to be ligated into a vector, the vector is at first linearized by 

1 5 cutting with the appropriate restriction endonuclease(s). The linearized vector is then treated with alkaline 
phosphatase or calf intestinal phosphatase. The phosphatasing prevents self^igatibn of the vector during the 
ligatton step. 

After iigafion, the vector with the foreign gene now inserted is transfonned into a suitable host cell. 
Prokaryotes are the preferred host cells for this invenHon. Suitable prokaryotic host cells include £ coli strain 

20 JM101, £ coli K12 strain 294 (ATCC number 31.446). £ coB strain W31 10 (ATCC number 27.325), £ coli 
X1776(ATCC number 31.537). £co// XL-1 Blue (stratagene), and £ co/; B; however many other slra&is of £ 
CO//, such as HB101, NIW522, NM538. NM539. and many other spedes and genera of prokaryotes maybe used as 
well. In addition to the £ coli strains listed above, bacilli such as BacBIUS SUt?tiliS. other enterobacteriaceae such as 
.«?aimonfliiatvDMmurium nr Rfirratia marcesans. and various EssudolDQlBS spedes mayal be used as hosts. 

25 Transfonnation of prokaryoticcells is teadByaccompllshedusing the calcium chloride method as 

described in sectfon 1.82 of Sambfookefat, supra. Alternatively, eIec&Dporation(Netmann efai,£iaB£liL 
l«4i [1982]) msf be used to tiansfonn these cells. The transfonned eels are setected by growth on an 
anfiWoflc, common^ tetracydine (tet) or ampk:ilin <amp), to whteh they are rendered resistart due to the 
presence of tet andtor amp resistanoe genes on the vector. 

30 Mer selection of the transformed cells, these cells are grown to culture and the plasmW DNA (or other 

vector with the foragn gene fiserted) is then isolated. PlasmW DNA can be isolated using methods known in the 
art. Two suitable methods are the small scate preparation of DNA and the large-scale preparation of DNA as 
described in sectnns 1.25-1 J3 of Sambrook et af., supra. The isolated DNA can be purified by methods known in 
the fflt such as that described in section 1 .40 of Sambrook etal.. supra. This purified plasmid DNA is then 

35 analyzed by restrictfcin mapping and/or DNA sequendng. DNA sequendng is generally performed by either the 
methodofHtessingefa/. n. trif^. Adds Res.. 9309 119811 or bv the method of Maxam etal. Msth.Enzymol,. 65: 
499 [1980]. 
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IV. fipfyRRlnn 

This invention contemplates fusing the gene enclosing the desired polypeptide (gene 1 ) to a second gene 
(gene 2) such that a fusion protein Is generated during transcription. Gene 2 is typically a coat protein gene of a 
phage, and preferably It is the phage M13 gene III coat protein, or a fragment thereof. Fusion of genes 1 and 2 may 
5 be accomplished by inserting gene 2 into a particular site on a plasmid that contains gene 1 , or by inserting gene 1 
into a particular site on a plasmid that contains gene 2. 

Insertton of a gene into a plasmid requires that the plasmid be cut at the precise location that the gene is 
to be inserted. Thus, there must be a restriction endonudease site at this location (preferably a unique site such 
that the plasmid will only be cut at a single location during restriction erxionuclease digestion). The plasmid is 

1 0 digested, phosphatased, and purified as described above. The gene is then inserted into this linearized plasmid by 
llgating the two DNAs together. Ligation can be accomplished if the ends of the plasmid are compatible with the 
ends of the gene to be inserted. If the restriction enzymes are used to cut the plasmid and isolate the gene to be 
inserted create blunt ends or compatible sticky ends, the DNAs can be llgated together directly using a llgase 
such as bacteriophage T4 DNA llgase and incubating the mixture at 16'C for 1-4 hours In the presence of ATP 

15 and ligase buffer as described in section 1 .68 of Sambrook et al., suQia- K the ends are not compatible, they must 
first be made blunt by using the Klenow fragment of DNA polymerase I or bacteriophage T4 DNA polymerase, 
both of which require the four deoxyribonudeotlde triphosphates to fill-in overhanging single-stranded ends of 
the digested DNA. Altematively, the ends may be blunted using a nuclease such as nuclease SI or mung-bean 
nuclease, both of which function by cutting back the overhanging single strands of DNA. The DNA is then 

2 0 religated using a ligase as described above. In some cases, It may not be possible to blunt the ends of the gene to 

be inserted, as the reading frame of the coding regton will be altered. To overcome this problem, oligonucleotide 
linkers may be used. The linkers sew as a brklge to connect the plasmid to the gene to be inserted. These linkers 
can be made synthetk^aliy as double stranded or single strarKled DNA using standard methods. The linkers have 
one end that is compatible with the ends of the gene to be inserted; the linkers are first llgated to ttits gene using 

25 ligation methods described above. The other end of the linkers is designed to be compatible with the plasmkJ for 
llgatton. In designing the linkers, care must be taken to not destroy the reading frame of the gene to be Inserted 
or the reading frame of the gene contained on the plasmid. In some cases, it may be necessary to design the linkers 
such that they code for part of an amino acid, or such that they code for one or more amino acids. 

Between gene 1 and gene 2, DNA encoding a termination codon may be inserted, such termination codons 

30 are UAG( amber), UAA (ocher) and UGA (opel). (Microbtotogy, Davis et al. Harper & Row, New York, 1980, 

pages 237, 245-47 and 274). The terminatton codon expressed in a wild type host cell results in the synthesis of 
the gene 1 protein product without the gene 2 protein attached. However, growth in a suppressor host cell 
results in the synthesis of detectable quantities of fused protein. Such suppressor host cells contain a tRNA 
modified to insert an amino acid in the tenninatton codon positton of the mRNA thereby resulting in production of 

3 5 detectible amounts of the fusion protein. Such suppressor host cells are well known and described, such as Eco// 

suppressor strain (Bullock et al., BioTechniques 5, 376-379 [1987]). Any acceptable method may be used to 
place such a termination codon into the mRNA encoding the fusion polypeptide. 

The suppressiUe codon may be inserted between the first gene encoding a polypeptide, and a second 
gene encoding at least a portton of a phage coat protein. Altematively, the suppressiUe temiination codon may be 
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inserted atljacent to !he fawn site by replacing the last amino acid triplet In the polypeptide or the first amino 
add fn the phage coatprotein. Whai the phagemid containing the suppresslWecodonfe 
host ceH, it results m the detectable production of a fusion polypeptide containing the polypeptide and the coat 
protein. When the phagemid is grown in a non-suppressor host cell, the polypeptide is synthesized substantially 
5 without fusion to the phage coat protein due to termination at the inserted suppressiWe triplet encoding UAG, 
UAA, or UGA. In the non-suppressor cell the polypeptide is synthesized and secreted from the host cell due to the 
absence of the fused phage coat protein which otherorise anchored it to the host cell. 
V. Aiteratjonrmutatin nv of Gene 1 at Selected Positions 

Gene 1 . encoding the desired polypeptide, may be altered at one ormore selected codons. An alteration 
10 is defined as a substitution, deletion, or insertion of one or more codons in the gene encoding the polypeptide that 
results in a change in the amino add sequence of the polypeptide as compared with the unaltered or native 
sequence of the same polypeptide. Preferably, the alterations wiB be by substitution of at least one amino add 
with any other amino addfn one or more regions of the molecute. The alterations may be produced be a variety of 
methods known in the art These methods indude but are not limited to ollgonucleofide-mediated mutagenesis and 

15 cassette mutagenesis. 

A. aigonudeotMe^latedMitagenesIs 

Oligonudeotide -mediated mutagenesis is preferred method for preparing substituton, deletion, and 
insertion variants of gene 1- Thistechnique is well known in the art as described by Zoller et al. Nucleic Adds Res. 
Jfl: 6487-6504 [1987]. Briefly, gene 1 is altered by hybridizing an oligonudeotide encoding the desired mutatton 
20 to a DNA template, where the template is the single-stranded form of the plasmid conteining the unaltered or 
natVe DNA sequence of gene 1 . After hybridization, a DNA polymerase is used to synthesize an entire second 
complementary strand of the template will thus incorporate the oiigonudeotWe primer, and will code for the 
selected alteration in gene 1 « 

Generally, oligonudeotkles of at least 25 nucleotides in length are used An optimal oligonudeotide will 
25 have 1 2 to 15 nudeotdes that are completely complementery to the template on either sideof the nudeotide(s) 
coding for the mutefibn. This ensures that the oligonudeotide will hybridize property to the single-stranded DNA 
template mdecule. The olgonudeotides are readily synthesized using techniques known in the art such as that 
described by Crea etal. Pm^ Mafi- Acad. Sd. usA. 75: 5765 [1978]. 

The DNA template can only be generated by those vectors that are either derived from bacteriophage 
3 0 MT3 vecters (Bie commerdally available M13mpt8 and M13mp19 vectors are suitable), or those vectors that 
contain a single-stranded phage origin of replication as described by Viera etai Metfl, EnzvmQln 153: 3 [1987]. 
Thus, the DNA that is to be mutated must be inserted into one of these vectors in order to generate single- 
stranded template. Production of the single-stranded template is described in sections 4.21-4.41 of Sambrook 
etal., supra. 

35 To alter the native DNA sequence, the oligonudeotide is hybridized to the single stranded template 

under sulteble hybridization conditions. A DNA polymerizing enzyme, usually the Kienow fragment of DNA 
polymerase I, is then added to synthesize the complementary strand of the template using the oligonucleotide as a 
primer for synthesis. A heteroduplex molecule is thus formed such that one strand of DNA encodes the mutated 
form of gene 1 . and the other strand (the original template) encodes the native, unaltered sequence of gene 1 . 



wo 92/09690 



17 



PCr/US91/09133 



This heteroduplex molecule is then transformed into a suitable host cell, usually a prokaiyote such as £ Coll 
JM101 . After growing the cells, they are plated onto agarose plates and screened using the oligonucleotide primer 
radiolatelled with 32-Phosphate to identify the bacterial colonies that contain the mutated DNA. 

Tlie method described immediately above may be modified such that a homoduplex molecule is created 
5 wherein both strands of the piasmid contain the mutation(s). The modifications are as follows: The single- 
stranded oEgonudeotide is annealed to the single-stranded template as described above. A mixture of three 
deoxyribonudeotides, deoxyriboadenoslne (dATP), deoxyriboguanosine (dGTP), and deoxyribothymidine (dTTP), 
is combined with a modified thio-deoxyribocyfoslne called dCTP-(aS) (which can be obtained from Amersham). 
This mixture is added to the template-oligonucleotide complex. Upon addition of DNA polymerase to this mixture. 

10 a strand of DNA identical to the template except for the mutated bases is generated. In addition, this new strand 
of DNA will contain dCTP-(aS) instead of dCTP, which serves to protect it from restriction endonuclease 
digestion. After the template strand of the double-stranded heteroduplex is nicked with an appropriate 
restriction enzyme, the template strand can be digested with Exolll nuclease or another appropriate nuclease past 
the region that contains the site(s) to be mutagenized. The reaction is then stopped to leave a molecule that is 

1 5 only partially single-stranded. A complete double-stranded DNA homoduplex is then formed using DNA 

polymerase in the presence of all four deoxyribonucleotide triphosphates, ATP, and DNA llgase. This homoduplex 
molecule can then be transformed into a suitable host cell such as £ coll JM1 01 , as described above. 

Mutants with more than one amino acid to be substituted may be generated in one of several ways. If the 
amino acids are located dose together in the polypeptide chain, they may be mutated simultaneously using one 

20 oligonudeotide that codes for all of the desired amino acid substitutions. If, however, the amino acids are located 
some distance from each other (separated by more than about ten amino adds), it is more difficult to generate a 
single digonucleotide that encodes all of the desired changes. Instead, one of two alternative methods may be 
empfoyed. 

In the first method, a separate oligonudeotide is generated for each amino add to be substituted. The 
25 oligonudeotides are then annealed to the single-stranded template DNA simultaneously^ and the second strand of 
DNA that is synthesized from the template will encode all of the desired amino add substitutions. The alternative 
method involves two or more rounds of mutegenesis to produce the desired mutant. The first round is as 
described for the single mutants: wild-type DNA is used for the template, an oligonudeotide encoding the first 
desired amino add substitution(s) is annealed to this template, and the heteroduplex DNA molecule Is then 
30 generated. The second round of mutagenesis utilizes the mutated DNA produced in the first round of 

mutagenesis as the template. Thus, this template already contains one or more mutatfons. The oligonucleotide 
encoding the additional desired amino acid substitution(s) is then annealed to this template, and the resulting 
strand of DNA now encodes mutations from both the first and second rounds of mutagenesis. This resultant DNA 
can be used as a template in a third round of mutagenesis, and so on. 

35 B. Cassette Muiaqengsls 

This method is also a preferred method for preparing substitution, deletion, and insertfon variants of 
gene 1 . The method is based on that described by Wells etal. 34:315 [1985].. The starting material is the 
piasmid (or other vector) comprising gene 1, the gene to be muteted. The codon(s) in gene 1 to be mutated are 
identified. There must be a unique restriction endonudease site on each side of the identified mutation site(s). If 
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no such restriciion sites exist. Ihey may be geneiated using the above-descsribed oligonucteotide-niediated 
mutagenesis method to introduce them at appropriate locations in gene 1. After the restriction sites have been 
introduced into the plasmki, the plasmid Is cut at these sites to linearize it. A double-stranded oligonucleotide 
encoding the sequence of the DNA between the restriction sites but containing the desired mutafeon{s) is 
5 synthesized using standard procedures. The two strands are synthesized separately and then hybridized 
together using standard techniques. This double-stranded oligonudeolide is refened to as the cassette. This 
cassette Is designed to have 3' and 5' ends that are compatible with the ends of the linearized plasmid, such that it 
can be directly ligated to the plasmid. This plasmid now contains the mutated DNA sequence of gene 1 . 
VL Qbtelnlnfl DNA p nmrilfvy the desired protein 

10 in an alternative embodiment, tWs invention contemplates production of variants of a desired protein 

containing one or more subunlts. Each subunitis typically encoded by separate gene. Each gene encotfng each 
subunItcanbeobtainedbymethodslaxiwnintheart{^^^ In some instances, itmaybe 

necessary to obtain the gene encoding the various subunits using separate techniques selected from any of the 
methods described in Section II. 

1 5 When construcBng a replicaWe expression vector where the protein of interest contains more ttian one 

subunit, all subunits can be regulated by the same promoter, typically located 5* to the DNA encoding the subunits. 
or each may be regulated by separate promoter suitably oriented in the vector so that each promoter is operaWy 
linked to the DNA itis intended to regulate . Selection of promoters is carried out as described in Section III 
above. 

20 In constructing a repiicaWe expression vector containing DNA encoding the protein of interest having 

multiple subunits, the reader is referred to Rgure 10 where, by way of illustration, a vector is diagrammed 
showing DNA encoding each subunit of an antibody fragment This figure shows that, generally, one of the 
subunits of the protein of interest will be fused to a phage coat protein such as Mt3 gene III. This gene fusion 
generally will contain ife own signal sequence. A separate gene encodes the other subunit or subunite, and it is 

25 apparentthateach subunit generally has its own signal sequence. Figure 10 also shows that a single promoter can 
regulate the expression of both subunits. Alternatively, each subunit may be Independently regulated by a 
different promoter. The protein of Interest subunit-phage coat protein fusion construct can be made as 
described in Section IV above. 

When constructing a fiamily of variante of the desired multi-subunit protein, DNA encoding each subunit 

30 in the vector may muteted in one or more positions in each subunIL When multi-subunit »itibody variants are 
constructed, preferred sites of mutegenesis correspond to codons encoding amino add residues located in the 
compleraentari^-detennming regions (CDR) of either the light chain, the heavy chain, or both chains. The CDRs 
are commonly referred to as the hypervariaWe regions. Methods for mutagenizlng DNA encoding each subunit of 
the protein of interest are conducted essentiaDy as described in Section V above. 



35 



Target proteins, such as receptors, may be isolated from natural sources or prepared by recombinant 
methods by procedures knovm in the art. Byway of illustration, glycoprotein homione receptors may be prepared 
by the technique described by McFariand efa/.,S£isDffi 245:494-499 [1989), nonglycosylated fomis expressed 
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in £ CO// are described by Fuh et at. J. Biol. Chem 265:31 1 1-31 15 [1990] Other receptors can be prepared by 
standard methods. 

The purified target protein may be attached to a suitable matrix such as agarose beads, acrylamide 
beads, glass beads, cellulose, various acrylic copolymers, hydroxytalkyi methacrylate gels, polyacryiic and 
5 polymethacrylic copolymers, nylon, neutral and tonic carriers, and the like. Attachment of the target protein to the 
matrix may be accomplished by methods described in Methods in Enzymology. 44 [1976], or by other means known 
in the art. 

After attachment of the target protein to the matrix, the immobilized target Is contacted with the 
library of phagemki particles under conditions suitable for binding of at least a portx>n of the phagemki particles 
10 with the immobilized target, ^tormaily, the conditions, including pH, nnc strength, temperature and the like will 
mimto physiotoglcai oondittons. 

Bound phagemid particles ('binders") having high affinity for the immobilized tai^et are separated 
from those having a low affinity (and thus do not bind to the target) by washing. Binders may be dissociated 
from the immobilized target by a variety of methods. These methods include competitive dissociation using the 
1 5 wiU-type llgand, altering plH and/or ionic strength, and methods known In the art. 

Suitable host cells are infected with the binders and helper phage, and the host cells are cultured under 
condittons suitable for amplification of the phagemid particles. The phagemkj particles are then collected and the 
selection process is repeated one or more times until binders having the desired affinity for the target molecule 
are selected. 

2 0 Optionally the library of phagemid particles may be sequentially contacted with more than one 

immobilized target to improve selectivity for a particular target. For example, it is often the case that a llgand 
such as hGH has more than one natural receptor. In the case of hQH, both the growth hormone receptor and the 
prolactin receptor bind the hGH ligand. It may be desirable to improve the selectivity of hGIH for the growth 
homione receptor over the prolactin receptor. This can be acNeved bff first contacting the library of phagemid 

25 particles with immobilized prolactin receptor, eluting those with a tow affinity (i.e. tower than wild type hGH) for 
the prolactin receptor and then contacting the tow affinity prolactin 'binders* or non-binders with the 
immobilized growth hormone receptor, and setocting for high affinity growth hormone receptor binders. In this 
case an hGH mutant having a tower affinity for the prolactin receptor would have therapeutic utility even if the 
affinity for the growth hormone receptor were somewhat tower than that of wild type hGH. This same strategy 

30 may be employed to improve selectivity of a particular hormone or protein for its primary function receptor over 
Its clearance receptor. 

In another embodiment of this Invention, an improved substrate amino acid sequence can be obtained. 
These may be useful for making better "cut sites" for protein linkers, or tor better protease 
substrates/inhibitors. In this embodiment, an immobilizable molecule (e.g. hGH-receptor, biotin-avidin, or one 
35 capable of covalent linkage with a matrix) is fused to gene ill through a linker. The linker will preferably be from 3 
to 10 amino acids in length and wll act as a substrate for a protease. A phagemto will be constructed as described 
above where the DNA encodrig the linker region is randomly mutated to produce a randomized Ibrary of phagemid 
part'des with different amino acid sequences at the linking site. The library of phagemid particles are then 
immobilized on a matrix and exposed to a desired protease. Phagemid particles having preferred or better 
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substrate amino acid sequences in the liner region for the desired protease wUI be eluted, Itfst producing an 
enriched pool of phagemid parlides encoding preferred linkers. These phagemid particles are then cycled several 
more times to produce an enrfched pool of partictes encoding consaise se(jjence{s) (see examples XIII and XIV). 

m Gimrth Homione Variant and Methods of Use 

5 The cJoned gene for hGH has been expressed in a secreted fonn in SdlfidCda ^ (Chang, C. N>, ef a/., 

[1987] iafii£5L.l89) and its DMA and amino acid sequence has been reported (Goeddel, etaL [1979] HalUffiM. 
544; Gray ef a/., [1985] Gene 39. 247). The present invention describes novel hGH variants produced using the 
phagemid selection methods. Human growth hormone variants containing substitutions atpositfons 10, 14, 18, 21, 
167, 171. 172, 174, 175. 176, 178 and 179 have been descrfoed. Those having higher binding affinities are 
1 G described in Tables VII. XIII and XIV. The amino acid nomenclature for describing the variants is shown below. 
Growth hormone variants may be atfminislered and fonnulated in the same manner as regular growth bomione. The 
growth homione variants of the present invention may be expressed in any recombinant system which is capable of 
expressing native or met hGH. 

Therapeutic fomiulations of hGH for therapeutic administration are prepared for storage by mixing 
1 5 hGH having the desired degree of purity with optional physiologically acceptable can-iers, exc'pents, or 

SiabiHzers (RpmirnjtQn's Pha rmarPiitinal Sciences. 16lh edition, Osol. A., Ed., (1980)., in the fomi of lyophilized 
cake or aqueous solutions. Acceptable carriers, excipients or stabiOzers are nontoxic to redplents at the dosages 
and concentrations employed, and include buffers such as phosphate, citrate, and other organic acids; antioxidants 
including ascorbic acid; low molecular weight (less than about 1 0 residues) polypeptides; proteins, such as serum 
20 albumin, gelatin, or immunoglobulins; hydrophilic polymers such as polyvinylpyrrolidone; amino acids such as glycine, 
glutemine, asparagine, arglnine, or lysine; monosaccharides, disaccharides. and other carix)hydrates including 
glucose, mannose, or dextrins; chelating agents such as EDTA; divalent metal ions such as zinc, cobalt or copper; 
sugar alcohols such as mannitoi or sorbitol; salt-fomiing counterions such as sodium; and^or nonionic surfactants 
such as Tween, Pluronics or polyethylene glycol (PEG). Formulations of the present invention may additionally 
25 contain a phamiaceutkally acceptable buffer, amino add. buMng agent and/or non4onfcsurbctanL Ttiese include, 
for example, buffers, chelating agents, antioxidants, presen/atives, cosolvents, and the liise; specific examples of 
these could incfude. trimethylamaine salts (Iris buffer"), and disodium edetate. The phagemids of the present 
Invention may be used to produce quantities of the hGH variants free of flie phage protein. To express hGH 
vartentsfree of the gene III portion of the fuston. pS0643 and derivatives can simply be grown m a non- 
30 suppressor strain such as 16C9. lnthiscase,the ambercodon(rA6) leads to termination of translation, which 
yields free hormone, without the need for an Independent DNA constmction. The hGH variant is secreted from Bie 
hostand may be isolated from the culture medium. 

One or moreof ttie eighthGHamino adds F10, M14, H18, H21, R167. D171, T175 and 1179 may be reped 
by any amino add other flian the one found in that position in naturally occurring hGH as indicated. Therefore, 1 , 2, 
35 3, 4, 5, 6, 7. or all 8 of the Indicated amino adds. F10, M14, H18, H21, R167, D171. T175 and II 79, may be replaced 
by any of the other 19 amino adds out of the 20 amino acids listed below. In a preferred embodiment, all eight 
listed amino adds are replaced by another amino add. The most preferred eight amino acids to be substituted are 
indicated in TaWe XIV in Example XII. 
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Airfeio add nomendabis. 

Ala (A) 
Arg (R) 

5 Asn (N) 

Asp (D) 
Cys (C) 
Gin (Q) 
Glu (E) 

10 Gly(G) 

His (H) 
He (I) 
Leu (L) 
Lys(K) 

15 Met(M) 

Phe (F) 
Pro (P) 
Ser (S) 
Thr (T) 

20 Trp (W) 

Tyr (Y) 
Val (V) 

The one letter hGH variant nomenclature first gives the hGH amino acid deleted, for eiemple glutamate 1 79; then 
the amino acid inserted; for example, serine; resulting in (E1795S). 

25 

EXAIMPLES 

Without further description, I is t)elieved that one of ordinary sidii in the art can, using the preceding 
description and lustrative examples, malce and utilize tte present invention to the fullest extent The following 
worldng scamples therefore specifically point out preferred emtxxliments of the present invention, and are not to 
30 be construed as limiting in any of the remainder of the disclosure. 

EXAJMPLE t 

Ptasmid Constniclions and Preparation of hGH-phagemid Particles 

The plasmid phGH-M13glll (Fig. 1), was constructed from M13K07^ and the hGH producing plasmid, 
pB0473 (Cunningham, B. C, etal. , Science. 243:1330-1336. [1989]). A synthetic oligonucleotide 5'-AGC- 
35 TGT-GGC-TTC- GGG-CCC -TTA-GCA-TTT-AAT-GCG-GTA-3' was used to introduce a unique Apa\ 

restriction site (underlined) into pB0473 after the final Phe191 codon of hGH. The oligonucleotide 5'-TTC- 
ACA-AAC-GAA- GGG-CCC-C TA-ATT-AAA-GCC-AGA-3' was used to introduce a unique Apa\ restriction site 
(underlined), and a Glu197-to-amt]er stop codon (tx)ld lettering) into M13K07 gene ill. The oligonucleotide 5'- 
CAA.TAA-TAA-CBG -GCT-AGC- CAA-AAG-AAC.TGQ-3' introduces a unique Alhei ^te (underlined) after the 
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3' end Of the gene III co*^ sequence. The resulting 650 base pair (bp) 4pai-Wtel fragment from the dM^ 
mutated M13K07 gene Bl was dbned Into the large Apai-Nha fragment of pB0473 to create the plasmid, 
PS0132. TOs fuses the cartwxyl terminus of hGH (PhelSI) to the Pio198 residue of the gene III protein with the 
insertion of a glyciie residue encoded from the Apa\ site and places the fusion protein under control of the £ coli 
5 ailaline phosphatase (p*»A) promoter and stil secretion signal sequence (C^ 

[19871). Fbr inducible expression of the fusion protein in rich media, we replaced the phoA promoter with the lac 
promoter and operator. A 138 bp £coRI-Xbal fragment containing the lac promoter, operator, and Cap binding 
site was produced by PGR of plasmid pUC1 19 using the digonudeotides 5'- 

CACGACASMIICCCGACTGGAAA-3' and 5'-CTGTT ICIASAGTGAAATTGTTA-3' that flank the desired 
1 0 lac sequences and introduce the EcoRI and Xbaf restriction sites (underiined); This lac fragment was gel purified 
and ligated into the large EcoRI-Xbal fifagmentof pS0132 to create the plasmid, phGH-M13gllL The sequences 
of all tailored DNA junctions were verified by the dideoxy sequence method (Sanger, F.. ef a/. Pffiff. Natl. tead. 
Sd. U S.A. 745463-5467, [1 977]). The R64A variant hGH phagemid was constajcled as follows: Ihe Nsil-Bglll 
mutated fiagment of hGH (Cunninghamef a/, supra ) encoding the Atg64 to Ala subsBtuUon (R64A) 
1 5 (Cunningham, B. C, WeBs. J. A.. Scifino. 244:1081-1085. J1989» wasdonedietween the corresponding 
restriction sites in the phGH-M13glll plasmid (Rg. 1) to replace the wild-type hOH sequence. The R64A hGH 
phagemid particles were propagated and titered as described below for the wild-type hGH-phagemid. 

Plasmids were transfonned trio a mate strain of £ co// (JM101) and setected on carbenidllin plates. A 
singte iransformant was grown in 2 ml 2YT medium for 4 h at src and infected with 50 nJ of M13K07 helper 
20 phage. The infected culture was diluted into 30 ml 2YT, grown overnight, and phagemid particles were harvested 
by predpitafion with polyethylene glycol (Vierra, J., Messing. J. .M^thmlsin FnmiOlPqY. 153:3-11. 11987]). 
Typical phagemid particle titers ranged from 2 to 5 x 10^^ cfu/ml. The particles were purified to homogeneity by 
CsCi density centrifugalion (Day, LA J MqI. Biol.. 39265-277, [1969]) to remove any fusion protein not 

atteched to virions. 
25 EXAMPLE H 

iiii M iodwitd/telysescfKaoniheRBfai Phage 
flabbit polyclonal anfibodfes to hGH were purified witti protein A, and coated onto microliter plates 
(Nunc) at a concentraflon of 2 pgftnl in 50 mM sodium carijonate buffer (pH 10) at 4'C for 16-20 fwurs. After 
washing in PBS containing 0.05% Tween 20. hGH or hGH-phagemid parlicteswere serially diluted from 2.0 - 
30 0.002 nM in buffer A(50 mM Tris (pH75), 50 mM NaCI, 2 mM EDTA,5mgrtnl bovine serum albumin, and 0.05% 
Tween 20). After 2 hours at room temperature (rt), the plates were washed weB and the indicated Mab 
(Cunninghamef af- s^xa ) was added at 1 \igM in bufter A for 2 hours at rt. Following washing, horseradish 
perondase corjigated goat anti-mouse IgG antibody was bound at rt for 1 hour. After a final wrash, the 
peroxidase sstivi^was assayed with the substrate, o-phenytenediamine. 
35 EXAMPl£ II 

toupling of the hGH Blndhg Protein to Polyacrylamlde Beads and HndIng Enrich 
Oxirane polyacrylamide beads (Sigma) were conjugated to flie purified extracellular domain of the hGH 
receptor (hGHbp) (Fuh. G., etal., J Biol. Chem. . 265:3111-3115 [1990]) containing an extra cysteine residue 
introduced by site-directed mutegenesis at position 237 that does not affect binding of hGH (J. Wells, 
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unpunished). The hGHbpwascor^ugated asrecommended by the ^^^^ 
oxiranebead.asmeasu,«lbyttnc*ngofl125pGHtothe,e8in.Subsequenlly.anyw^^ 

blocked v^th BSA and Tris. As a control tor non-specif fe binding of phagemid particles. BSA «a8 simitariy 
coupled to the beads. Buffer for adsorption and cashing contained l0mMTrisHCl(pH7.5). 1 mMEDTA.50mM 

5 Naa lmg/mlBSA.and0.02%T«een20. Elution buffers contained wash buffer plus 200 nMhGH or 0.2 M 
oly«ne(pH2.1). Pa^nlai phage M13K07 was n,l«d with hGH phagemid parfides at a ratio of nearty 3000^^ 
(onginalmixtu.«)andtum.^edfor8-12hw»ha5Mialk,uot(02n,gofacrylama^ 
50 ul «iun« at room temperature. The beads were pelleted by centrifugation and the supemate carefully 
removed The beads were ^suspended in 200 ul wash buffer and tumbled at room temperature for 4 hou.. 

10 (washi). Afterasecondwash{wash2).thebeadswereelutedtwicewith200nMhGHfor6-l0hourseach 
(eluatel eluate2). The final elution was with a glycine buffer (pH 2.1) for 4 hours to remove remaining hGH 
phagemii particles (eluate 3). Each fraction was diluted appropriately in 2YT med«. mixed with fresh JM101 . 
incubated at src for 5 minutes, and [^ated with 3 ml of 2YT soft agar on LB or LB carbeniciliin ptotes. 

EXAMPLE W 

15 construcUonofhGH-phageraidPartlcleswtthaMixtureofGenelllProd^ 

The gene 111 protein is composed of 410 residues divided into two domains that are separated by a 
,texiWeiinkersequence(Armstrong.J..efa/.. EEBiifitL 135:167-172, [1981]). The an,ino-tem,lnal domah is 
required for attachment to the pill of E ooli. whie the carboxyHerminal domain Is imbedded in the phage coat and 
requiredforproperphageassembly (Crissman. J. W.. Smith. G. P.. itoto 132:445-455.11984]). The signal 

20 sequenceandamino-terminddomaincfgenelllwasreplacedwMhthestllsig^ 

supra) byfusiontoresidue198inlhecarboxyi4erminaldomalnofgenelll(F«.1). ThehGH-genelllfu^onwas 
placed under conlrol of the lac promoter/operator in a plasmid (phGH-M13glll; l^g. 1) containing the pBR322 
p^ciamasegeneandColEI replication origin, and the phage f1 imergenfc region. The vector can be easily 
maintainedasasmallplasmid«ctorby8elecliononcarbenialn.w^^ 

25 for prepagation. Alternatively, the plasmid can be efBdently packaged into virions (calted phagemid particles) by 
infectton with helper phage such as M13K07 (Viera ef aL supra ) whfch avoWs problems of phage assembly. 
Phagemid infectlvity titers based upon transduction to carbeniciliin resistance in this system varied from 2-5 x 
101 Icolony lbm.ing units (cfuymL The titer of the M13K07 helper phage in these phagemkl stocks is -10 
plaque forming units (pfu)yml. 

3 0 W* this system we confirmed previous studies (Parmley. Smith SMP«; that homogeneous expresson of 

large proteins fused to gene 111 is deleterious to phage production (data not shown). For example, induction of the 
lac promoter in phGH-Ml3glll by addition of IPTG produced low phagemW titers. Moreover, phagemid particles 
produced by co-infectton with M13K07 containing an amber mutation in gene III gave ver^ 
«1010 cfu/ml). we believed that multiple copies of the gene III fusion attached to the phagemid surface could 

35 lead to mulfi^e point attachment (the -chelate effecf) of the fu^on phage to the ImmobSized target protein. 
Therefore to control the fuston protein copy number we limited transcription of the hGH-gene 111 fusion by 
oullurlng the plasmid In £ coJ JM101 (lac^) which contains a constitutively high level of the lac repressor proiem. 
The £ cor. JM101 cultures containing phGH-M13glll were best propagated ard Infected with M13K07 in the 
absence of the lacoperon Inducer (IPTG): however, this system isfiexibie so that co-expression of other gene ill 
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fusion proteins can be balanced. We estimate that about 1 0% of the phagemid particles contain one copy of the 
hGH gene 1(1 fusion protein from the ratio of the amount of hGH per virion (based on hGH Immuno-reactive materiai 
in CsCI gradient purifed phagemid). Therefore, the titer of fusion phage displaying the hGH gene 111 fusion is 
about 2 - 5 X t O^Oyml. This number is much greater than the titer of E coif (-108 to 1 0^/ml) in the culture from 
5 which they are derived. Thus, on average every £co// ceil produces 10-100 copies of phage decorated with an 
hGH gene III fusion protein, 

BCAMPLE V 
Structural Integrftyof the hGHijene H Fusion 
Immunoblot analysis (Rg. 2) of the hGH-gene lit phagemid shew that hGH cross-reactive material 
1 0 comigrates with phagemid particles in agarose gels. This indicales that the hGK is tightiy associated with 

phagemid particles. The hGH-gene III fusion protein from the phagemid particles runs as a single immuno-stalned 
band showing that there is little degradation of the hGH when It is attached to gene 111. Wild-type gene III protein 
IS cleariy present because about 25% of the phagemid particles are Infectious. This is comparable to specific 
infectivity estimates made for wild-type M13 phage that are sfmilariy purified (by CsCI density gradients) and 
1 5 concentrations estimated by UV absorbance (Smith, G. P. supra and Pamfitey, Smith supra) Thus, both wild-type 
gene IH and the hGH-gene ill fusion proteins are displayed in the phage pod. 

It was important to confinn that the tertiary structure of the displayed hGH was maintained in order to 
have confidence that results from binding selections will translate to the native protein. We used monoclonal 
antibodies (Mabs) to hGH to evaluate the structural integrity of the displayed hGH gene III fusion protein (Table 
20 I). 

TABLE L Binding of Bght Different Monoclonal 
Antibodies (Mab*^) to hGH and hGH PhagemU Particles* 

IC50 (nM) 

Mab hGH hGH-phagemid 
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1 0.4 0.4 

2 0.04 0.04 

3 0.2 0.2 

4 0.1 0.1 

5 0.2 >2.0 

6 0.07 0.2 

7 0.1 0.1 

8 0.f 0.1 

*Vaiues given represent the concentration (nM) of hGH or hGH-phagemid particles to give half-maximal binding to 
the particular Mab. Standard enrors in these measurements are typically at or below ±30% of the reported value 
See Materials and Methods for further details. 

30 Theepitopes on hGH for these Mabs have been mapped (Cunningham etaL supra) and binding for 7 of 

8 Mabs requires that hGH be proper^ folded. The IC50 values for all Mabs were equivalent to wild-^ hGH 
except for Mab 5 and 6. Both Mabs5and6areknowntohavebiMngdetemiinantsnearlhecartx)xyl-te 
hGH which is blocked in the gene III fusion proteia The relafive IC50 value for Mabi which reacts wilh both native 
and denatured hGH is unchanged compared to the conformationally sensitive Mabs 2-5, 7 and 8. Thus, Mab1 

35 serves as a good internal control for any errors in matching the concentration of the hGH standard to that of the 
hGH-gene III fuSon. 
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EXAMPLE VI 
Binding Enrichments on RecepUK AffM 

Previous workers (Parmley. Smith supra ; Scott Smith supra; Cwiria et al, supra: and Deviin etal. 
5 supra) have fractionated phage by panning with streptavidin coated polystyrene petri dishes or microtiter plates. 
However, chromatographic systems would allow more efficient fractionation of phagemid particles displaying 
mutant proteins with different t)inding affinities. We chose non-porous oxirane beads (Sigma) to avoid trapping 
of phagemid particles in the chromatographic resin. Furthennore, these beads have a small particle size (1 \m) to 
maximize the surface area to mass ratio. The extracellular domain of the hGH receptor (hGHbp) (Fuh etal. , 
1 0 supra) containing a free cysteine residue was efficiently coupled to these beads and phagemid particles showed 
very low non-specific binding to beads coupled only to bovine serum albumin (Table 11). 

TABLE II. 



15 Specific Binding of Hormone Phage to hGHbp-coated 

Beads Provides an Enrlcliment for tiGi4-pliage over IM13K07 Phage* 





Sample Absorbent^ 


Total pfu 


Total cf u 


Ratio (cfu/pfu) 


Enrichment§ 


20 


Original mixluret 


8.3 X 10"''' 


2.9 X 108 


3.5 X 10-4 


(1) 




Supernatant BSA 




2.8 X 108 


3.8 X 10-4 


1.1 




hGHbp 


7.6 X 101 ^ 


3.3 X 108 


4.3 X 10-4 


1.2 




Wash 1 BSA 


1.1 X lO^O 


6.0 X 106 


5.5 X 10-4 


1.6 




hGHbp 


l-QxIO^O 


1.7 X 107 


8.9 X 10-4 


2.5 


25 


Wash 2 BSA 


5.9 x 107 


2.8 X 10^ 


4.7 X 10-4 


1.3 




hGHbp 


4.9x 107 


2.7 X 106 


5.5 X 10-2 


1.6x102 




Eluate 1 (hGH)BSA 


1.1 x 106 


1.9X 103 


1.7 X 10-3 


4.9 




hGHbp 


1.2x10^ 


2.1 X 106 


1.8 


5.1 X 103 




Eluate 2 (hGH)BSA 


5.9 x 105 


1.2x 103 


2.0 X 10-3 


5.7 


30 


hGHbp 


5.5x105 


1.3x 106 


2.4 


6.9x103 




Eluate 3 (pH 2.1)BSA 


4.6 X 105 


2.0 X 103 


4.3 X 10-3 


12.3 




hGHbp 


3.8 x 10^ 


4.0 X 10^ 


10.5 


3.0 X 104 



*The titers of M13K07 and hGH-phagemid particles In each fraction was deternnined 
35 by multiplying the number of plaque forming units (pfu) or carbenicillln resistant 
colony forming units (cfu) by the dilution factor, respectively. See Example IV for 
details. 

tThe ratio of M13K07 to hGH-phagemid particles was adjusted to 3000:1 In the 
original mixture. 
40 t Absorbents were conjugated with BSA or hGHbp. 

§Enrichments are calculated by dividing the cfu/pfu ratio after each step by cfu/pfu 
ratio in the original mixture. 

in a typical enrichment experiment (Table 11), one part of hGH phagemid was mixed with >3,000 parts 
M13K07 phage. After one cycle of binding and elution, 10^ phage were recovered and the ratio of phagemid to 
45 M13K07 phage was 2 to 1 . Thus, a single binding selection step gave >5000-fold enrichment. Additional elutions 
with free hGH or acid treatment to remove remaining phagemids produced even greater enrichments. The 
enrichments are comparable to those obtained by Smith and coworl«rs using batch elulion from coated 
polystyrene plates (Smith, G.P. supra and Pannely, Smith scjpra ) however much smaller volumes are used on the 
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beads(200filvs.6ml). TherewasalmostnoenrichmentlbrthehGHphag8irtdowBrMl3K07v*^ 
beads linked only to BSA. The slight enrichmentobservedforwnttol beads (-10-fold for pH 2.1 elutionr Table 
2) may result from trace contaminarts of bovine growth hormone binding protein present in the BSA linked to the 
bead. Neverflieless these data show the enrichments for the hGH phage depend upon the presence of the hGHfap 
5 on the bead suggesting binding occurs by specific interactfen between hGH and the hGHbp. 

We evaluated the enrichment for wikl-^ hGH over a weaker binding variant of the hGH on fusion 
phagemids to further demonstrate enrichment spedlictty. and to link the reduction in binding affinity for the 
purified homiones to enrichment factors after panning fusion phagemids. A fusion phagemid was constructed 
with an hGH mutant in which Arg64 was subslitoted with Ala (R64A). The R64A variant hormone is about 20- 
1 0 fokl reduced in receptor binding affinity compared to hGH (Kd values of 7.1 nM and 054 nM, respectively 

[Cunningham. Wells, supra]). The titers of Ihe R64AhGH-gene III fusfon phageraW were comparable to those of 
wild-type hGH phagemid. After one round of binding and elutfon (Table 110 tie wild-type hGH phagemid was 
enriched from a mixture of the two phagemkis plus M13K07 by B-foW relative to the phagemW RMA. and -10* 
relative to M13K07 helper phage. 



15 



20 



TABLElfl. hGHb|>coatedBeadsSelectforhGH Phagemids 
Over a Weaker Btading hGH Variant Phagemid 

r.nntmi heads . — hGH t JPbeads 



sample wn-nhanemid enrichment vyTphagemjd f'SjJ^S.A 

total phagemid forWr/R64A total phagemri forWr/R64A 

Original mixture 8^20 W fjfO ~W 

25 Supernatant ND - ^/lO i.o 

Efutionl(hGH) 7/20 0.8 17/20 8.5? 

Elution2(pH2.1) It/20 1.8 ZVZ? 5.2 

•The parent M13K07 phage, wild-type hGH phagemW and R64A phagemid partictes were mixed at a ratio of 
30 10*:0 4« 6 Binding selectfons were carried out using beads linked with BSA (control beads) or with the hGHbp 
(hGHto beads) as described in Tabte II and the Materials and Methods After each s^^^ . 
isolated(Bimb<im. H. C. Doly. J. . Nimlfffe Acids Res.. 7:1513-1523. 11979]) ««;!L«art«niaffl^^^^ 
and analyzed by restrictfon analysis to detennine if it contained the wld-lype hGH or the R64A hGH gene III 

fiision. . .u«.. u -J 

35 tFhe enrichment for wild-type hGH phagemid over R64A mutant was cateulated from the ratio of hGH pha^mid 
present after each step to that present in the original mixtore (8/20), divided by the corresponding ratio for 
R64A phagemMs. WT = wild-type; ND = not deteraiined. 

tThe enrichment for frfagemW over total M13K07 parentel pnsm was -1 0* after fiis step. 

Condusians 

40 By displaymga mixture of wild-type gene III and the gene 111 fusfon protein on phagemkl partfcles one can 

assemble and prop^e virions that display a large and proper foWed protein as a fusion to gene III. The copy 
number of the gene 111 fusfon protein can be effectively controlted to avoid "chelate effects" yet maintained at high 
enough levels in the phagemid pool to pemiit panning of large epitope Hbraries (>1 qIO). We have shown that hGH 
(a 22 kD protein) can be displayed in its native folded fomi. Binding selectfons performed on receptor affinity 

45 beads eluted with free hGH, efficiently enriched tor wild-type hGH phagemids over a mutent hGH phagemkl shown 
to have reduced receptor binding affini^. Thus, it is possible to sort phagemid partfcles whose binding constants 
are down n the nanomolar range. 
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Protein-protein and antibody-antigen interactions are dominated by discontinuous epitopes (Janin, J., 

e/a/., J. Mol. Biol.. 2M:155-164,[1988]; Argos, P,,Ei2L£Q1, 2:10M 13, [1988]; Barlow, DJ.,e/ a/. , NatUffi, 

322:747-748, [1987]; and Davies, D.R., et ai , J. Biol. Chem.. 263:10541-10544. [1988]); that is the residues 

directly involved In binding are close in tertiary structure but separated by residues not involved in binding. The 

5 screening system presented here should allow one to analyze more conveniently protein-receptor Interactions and 

isolate discontinuous epitopes in proteins with new and high affinity binding properties. 

EXAMPLE VII 
Selection Of hGH Mutarits from a Library Ra^ 
hGHC0dons172,174,176,178 

10 ConstHJction of template 

A mutant of the hGH-gene III fusion protein was constructed using the metfxxi of KunkeL,e/ a/. Meth. 
Enzvmol. 154. 367-382 [1987]. Template DNA was prepared by growing the plasmid pS0132 (containing the 
natural hGH gene fused to the carboxy-terminal half of M13 gene III, under control of the alkaline phosphatase 
promoter) in CJ236 cells with M13-K07 phage added as helper. Single-stranded, uracil-containing DNA was 

1 5 prepared for mutagenesis to introduce (1 ) a mutation in hGH which would greatly reduce binding to the hGH 
binding protein (hGHbp); and (2) a unique restriction site (Kpnl) which could be used for assaying for - and 
selecting against - parental background phage. Oligonucleotide-directed mutagenesis was can'ied out using T7 
DNA polymerase and the following digodeoxy-nucleotide: 

Gly Thr 

20 hGH codon: 178 179 

5"-G ACA TTC CTG fiGX A£.C GTG CAG 

< Kpnl > 

This oligo introduces the Kpnl site as shown, along with mutations (R178G. I179T) in hGH. These mutations are 

predicted to reduce binding of hGH to hGHbp by more than 30-fold. Clones from the mutagenesis were screened 
25 by Kpnl digestion and confirmed by dideoxy DNA sequencing. The resulting construct, to be used as a template 

for random mutagenesis, was designated pH0415. 

Random mutagenesis wfthin helix-4 of hGH 

Codons 172, 174, 176, 178 were targeted for random mutagenesis in hGH, again using the method of 

Kunkel. Single-stranded template from pH0415 was prepared as above and mutagenesis was carried out using 
30 the foltowing pool of oligos: 

hGH codon: 172 174 

5'- GC TTC AGG AAG GAC ATG GAC MNS. GTC NNS ACA- 

35 He 
176 178 179 

- NNS CTG NNS AXC GTG CAG TGC CGC TCT GTG G-3 » 

As shown, this oligo pool reverts codon 179 to wild-type (lie), destroys the unique Kpnl site of pH0415, and 
introduces random codons (NNS, where N= ^G,C, or T and S= G or C) at posiltons 172,174,176, and 178. Using 

4 0 this codon selection in the context of the above sequence, no additional Kpnl sites can be created. The choice of the 
NNS degenerate sequence yields 32 possible codons (including one 'stop' codon, and at least one codon for each 
amino acid) at 4 sites, for a total of (32)"^= 1,048,576 possible nucleotide sequences (12% of which contain at 
least one stop codon), or (20)^= 160,000 possible polypeptide sequences plus 34,481 prematurely terminated 
sequences (i.& sequences containing at least one stop codon). 

45 Propagation of the Initial library 
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The mutagenesis producls were extracted twice with phenolxhloroform (50S0) and ethanol 
predpitafcd with an excess of carrier tRNA to avoid adding salt that would confound the subsequent 
etectroporationstep. /^proximalelySOng (15fmols)of DNA waselectroporat^ cells (2.8 x 10^0 

ceifsmiL) in 45 jiL total volume in a 0.2 cm cuvette at a voltage setting of 2.49 kV with a single pulse (time 

5 constant -4.7 msec.}. 

The cells vwre allowed to recover 1 houratSTOG with shaking, then mixed with 25 mL2YT medium. 100 
\xglmL cartenidllin, and M13-K07 (multiplicity of Infection = 1000). Plating of serial dilutions from this culture 
onto cartwnidllin-containing media indicated that 8.2 x 10^ electrotransformants were obtained. After 10' at 
23^0, the culture was incubated overnight (15 hours) at 370C with shaking. 

1 0 After overnight incubation, the cells were pelleted, and double-slranded DNA (dsDNA), deSgnaled 

pLIBI , was prepared by the alkaline lysis method. The supematantwas spun again to remove any remaining cells, 
and the phage, designated phage pool (t>1 . were PEG-preapitated and resuspended in 1 mL STE buffer (10 mM 
Tris, pH 7.6, 1 mM EDTA, 50 mM NaCI). Phage titers were measured as colony-fanning units (CFU) for the 
recombinant phagemid containing hGH-gSp gene III fusion (hGH-g3) plasmid, and plaque-fonning units (PFU) far 

15 the M13-K07 helper phage. 

Bf ndlng seieetkin usTng HimMfaed hGHfao 

1. BINDING: An aliquot of phage pool 4»1 (6 x 10^ CFU, 6 x 10^ PFU) was diluted 4.5-fold in buffer A 
(Phosphate-buffered saline. 05% BSA, 0D5% Tween-20, 0.01% thimerosal) and mixed virith a 5 ^iL suspension of 
oxirane-poIyacr)tonide beads coupled to the hGHbp containing a Ser237 Cys mutation (350 fmols) in a 1.5 mL 

20 silated polypropylene tube. As a control, an equivalent aliquot of phage were mixed in a separate tube with beads 
that had been coated with BSA only. The phage were allowed to bind to the beads by incubating 3 hours at room 
temperature (23^C) with slow rotation (approximately 7 RPM). Subsequent steps were carried out with a 
constant volume of 200^iL and at room temperature. 

2. WASH: The beads were spun 15 sec., and the supernatant was removed (Sup. 1). To remove 

25 phageAphagemid not spedRcally bound, the beads were washed twice by resuspending in buffer A, then pelfeting. 
A final wash consisted of rotating the beads in buffer A for 2 hours. 

3. hGH ELUTION: Phage/|3hagemid binding weakly to the beads were removed by stepwise elution vnth 
hGH. In the first step, the beads were rotated with buffer Aconlaining 2 nM hGH. After 17 hours , the beads 
were pelleted and resuspended in buffer A containing 20 nM hGH and rotated for 3 hours, then pelfeted. In the 

3 0 final hGH wash, the beads were suspended in buffer A containing 200 nM hGH and rotated for 3 hours then 



4. GLYCINE ELUTION: To remove the tightest-binding phagemid (i.e. those still bound after the hGH 
washes), beads were suspended in Glycine buffer (1 M.Glycine, pH 2.0 witii HCI), rotated 2 hours and pelleted. 
The supernatant (fraction 'G'; 200jiL) was neutralized by adding 30 \± of 1 M Tris base. 
35 Fraction G eluted from the hGHbp-beads (1 x 10^ CFU. 5x10^ PFU) was not substentiaily enriched 

for phagemid over K07 helper phage. We belfeve this resulted from the fact that K07 phage packaged during 
propagation of the recombinant phagemid display ttie hGH-g3p fusion. 
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However, when compared with fraction G eluted from the BSA<x)ated control beads, the hGHbp-beads 
yielded 14 times as many CPU's. This reflects the enrichment of tight-binding hGH-displaying phagemid over 
nonspedficaily-binding phagemid. 

5. PROPAGATION: An aliquot (4.3 x 10^ CPU) of fraction G eluted from the hGHbp-beads was used 
5 to Infect log-phase WJM101 cells. Transducttons were carried out by mixing 100 fraction G with 1 mL WJM101 
celts, incubating 20 min. at 37<>C, then adding K07 (multiplicity of infection^ 1000). Cultures (25 mL 2YT plus 
carbeniclHtn) were grown as described above and the second pool of phage (Library 1G, for first glycine elution) 
were prepared as described above. 

Phage from library 1 G (Rg. 3) were selected for binding to hGHbp beads as described above. Fraction 
1 0 G eluted from hGHbp beads contained 30 times as many CPU's as fraction G eluted from BSA-beads in this 
selection. Again, an aliquot of fraction G was propagated in WJM101 cells to yield library tG^ (indicating that 
this library had been twice selected by glycine elution). Double-stranded DNA (pLIB 1G^) was also prepared 
from this culture. 

Kpnl dsm ml rpaiifftfon-setecttgn of dsPNA 
15 To reduce the level of background (Kpni*^) template, an aliquot (about 0.5 ^g) of pLIB 1G^ was 

digested with Kpnl and electroporated into WJM101 cells. These cells were grown In the presence of K07 
(multiplicity of infection^:: 100) as described for the initial library, and a new phage pool, pLIB 3, was prepared 

(Rg. 3). 

In addition, an aliquot (about 0.5 ^g) of dsDNA from the initial library (pLIBI) was digested with Kpnl 
20 and electroporated directly into WJM101 cells. Transfomnants were allowed to recover as above, infected with 
M13-K07, and grown overnight to obtain a new library of phage, designated phage Library 2 (Rg. 3). 

Successh/e rounds Of seleciiQD 

Phagemid binding, elution, and propagation were canned out in successive rounds for phagemid derived 
from both pLIB 2 and pLIB 3 (Rg. 3) as described above, except that (1) an excess (10-fold over CPU) of 
25 purified K07 phage (not displaying hGH) was added in the bead-binding cocktail, and (2) the hGH stepwise 
eludons were replaced with brief washings of buffer A alone. Also, in some cases, XL1 -Blue cells were used for 
phagemkf propagation. 

An additional digestion of dsDN A with Kpnl was carried out on pLIB 2G? and on pLIB before the 
final round of bead-binding selection (Rg. 3). 
30 DNA Sequencing off selected phaoeinids 

Pour independently isolated clones from LIB 4G^ and four independently isolated ck)nes from LIB SG^ 
were sequenced by dideoxy sequencing. All eight of these ctones had kjentical DNA sequences: 

hGH codon: 172 174 176 178 

35 5' -AAG GTC TCC ACA TAG CTG AGG ATC-3' 

Thus, all these encode the same mutant of hGH: (El 74S, P1 76Y). Residue 1 72 in these clones is Lys as in wild- 
type. The codon selected for 172 is also Wentical to wild-type hGH. This is not surprising since AAG is the only 
lysine-codon possible from a degenerate "NNS" codon seL Residue 178-Arg is also the same as wild-type, but 
here, the codon selected from the library was AAG Instead of CGC as is found in wild-type hGH, even though the 
4 0 latter codon is also possible using the "NNS" codon set. 
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|ympJirHvafKfl7 Infection 

The rautt^idty of infection of K07 infection is an important parameter in the propagation of 
recombinant phagemlds. The K07 multipBcity of Infection must be high enough to insure that virtuaOy all cells 
ftansfonned ortransfected with phagemid are al)ie to package newphagemid particles. Furthermore, the 
5 concentration of wiid-ftrpe gene ill in each cell should be kept high to reduce the possil)ility of mult^e hGH-gene III 
fusion molecules being displayed on each phagemid partide, thereby reducing chelate effects in binding. However, 
if the K07 mulBplidty of infection is too high, the packaging of K07 will compete with ttiat of recombinant 
phagemid. We find that acceptabte phagemid yields, with only 1 -10% background K07 phage, are obtained when 
the K07 multipricity of infection is 100. 

° TabielV. 



Phage Pool 


moi (K07} 


Enrichment 
CFU/PFU 


hGHbp/BSA beads 


Fraction Kpnl 


LIB1 
LIB1G 
LIBS 
LIB 3G3 
LIB 3G* 
UB5 


1000 

1000 

100 

10 

100 

100 


ND 

ND 

ND 

ND 

460 

ND 


14 

30 

1.7 

8.S 

220 

15 


0.44 

0.57 

0.26 

0.18 

0.13 

ND 


LIB 2 
LIB2G 
LIB 2G2 
LIB 4 


100 
10 
100 
100 


ND 
ND 
1000 
170 


1.7 
4.1 
27 
38 


<0.05 
<0.10 
0.18 
NO 



Phage pools are labelled as shown (Rg. 3). The multiplicity of infecfion (moi) refers to the multiplicl^ of K07 
fnfecliorv (PFU/cells) in the propagation of phagemid. The enrichmentof CPU over PFU is shown in those cases 

1 5 where purtfied K07 was added in the binding step. The ratio of CPU eluling from hGHbp^)eads over CPU eiufing 
from BSA-beads is shown. The fraction of Kpnl^nlaining template (i.e.. pH04t5) remaining in the poof was 
determined by digesfing dsDNA with Kpnl plus EcoRI, running the products on a 1% agarose gel, and laser- 
scanning a negative of the ethidium bromide-stained DMA. 
qpce ptoNblnding affinity Hf the hQfmonehGH/E174S. m&h 

ZO The fact that a single done was isolated from two different pathways of selection (Fg. 3) suggested 

that the double mutant {E174S,F176Y) binds strongly to hGHbp. To determine the affinity of this mutant of 
hGH for hGHbp, we constructed this mutant of hGH by site-directed mutagenesis, using a plasmid {pB0720} 
which contains the wiW-lype hGH gene as template and the following oligonucleotide which changes codons 1 74 and 
176: 

25 hGH codon: 172 174 176 178 

Lys Ser Tyr Arg 

5^- ATG GAC AAG GT£ ICG ACA T^C CTG CGC ATC GTG -3' 
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The resulting construct, pH0458B, was transfbmied into E. coli strain 1 6C9 for expression of the mutant 
hormone. Scatchard anafysis of competifive binding of hGH(E174S,P176Y) versus ^^\-hGH to hGHbp indicated 
that the (E174S.F176Y) mutant has a binding affinity at least 5.0-fbld tighter than that of wild-type hGH. 
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EXAMPLE VIII 

SELECTION OF hGH VARIANTTS FROM A 
5 HEUX-4 RANDOM CASSEHE UBRARY OF HORMONE-PHAGE 

Human growth hormone variants were produced by the method of the present invention using the 

phagemid described in figure 9. 

Congrtnjrttonofadfrfi^ 

We designed a vector for cassette mutagenesis (Wells et al., £fiD& 34, 315-323 [1985]) and expression 

10 of the hGH-gene III fusion prote^ with the objectives of (1 ) improving the linkage between hGH and the gene III 
moiety to more bvorably display the hGH moiety on the phage (2) limiting expression of the fusion protein to 
obtain essentially "monovalent display," (3) allowing for restriction nuclease selection against the starting vector, 
(4) eliminating expression of fusion protein from the starting vector, and (5) achieving facile expression of the 
corresponding free honmone from a given hGH-gene III fasion mutant. 

1 5 Piasmid pS0643 was constnjcted by oligonucleotide-directed mutagenesis (Kunkel et al., Alfitbod^ 

Enzymol. 154, 367-382 [1987]) Of pS0132, whk:h contains pBR322 and f1 origins of replication and expresses an 
hGH-gene III fusion protein (hGH residues 1-191, followed by a single Gly resklue, fused to Pro-198 of gene III) 
under the control of the E. coll p hoA promoter (Bass et al., Proteins 8, 309-314 [1990]){Figure 9). Mutagenesis 
was carried out with the oligonucleotide 5'-GGC-AGCTGT-GGC-T TC-TAG-AG T-GGC-GGC-GGC-TCT> 

20 GGT-3\ which introduces a Xbai site (underlined) and an amber stop codon (TAG) following Phe-191 of hGH. tn 
the resulting construct, pS0643, a portion of gene III was deleted, and two silent mutations (underiined) 
occurred, yielding the foltowing junctton between hGH and gene III: 

— liGB > genem ^> 

25 187 188 189 190 191 am* 249 250 251 252 253 254 

Ser Qys G^r Fbe Gfai Ser Gfy Qy Cfy Ser CSy 

GGC MSC TGT GGA TIC TAG AGT CCQ GOT GGC TCT GGT 

This shortens the total size of the fusion protein from 401 resUues in pS0132 to 350 residues in 
30 pS0643. Experiments using monoclonal antibodies against hGH have demonstrated that the hGH portion of the 
new fusion protein, assembled on a phage particle, is more accessible than was the previous, longer fusion. 

For propagation of hormone-displaying phage, pS0643 and derivatives can be grown in a amber- 
suppressor strain of E. coll. such as JM101 or XLI-Blue (Bullock et al., BioTechniques 5, 376-379 [1987]). Shown 
above is substitution of Glu at the amber codon which occurs in ^uqE suppressor strains. Suppression with other 
35 amino acids is also possible in various available strains of LM well known and pubik:ally available. 

To express hGH (or mutants) free of the gene 111 portion of the fusion, pS0643 and derivatives can 
simply be grown in a non-suppressor strain such as 16C9. In this case, the amber codon (TAG) leads to 
temoination of translation, whkdi yieUs firee hormone, without the need for an independent DNA constmctton. 

To create sites for cassette mutagenesis, pS0643 was mutated with the oligonucleotides (1) 5-CGG- 
40 ACT-GGG-CA£Lft]MTC-AAG-CAG-ACC-3\ which destroys the unique fiflllt site of pS0643; (2) 5'-CTC- 
AAG-AAC-TAC-GfiMIAdC£C-TGA-CTG-Cn-CAG-GAA-GG-3', which inserts a unique fislEll site, a 
single-base frameshift, and a non-amber stop codon (TGA); and (3) 5'-CGC-ATC-GTG-CAG-TGC -AGA-TCT - 
GTG-GAG-GGC-3', which introduces a new Bglli site, to yield the starting vector, pH0509. The addition of a 
frameshift along with a TGA stop codon insures that no genelll-fusion can be produced from the starting vector. 
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ThftBstEII- Bqiii senment is cut out of pH0509 and replaced with a DNA cassette, mutated at the codons of 
interest Other restriction sites for cassette mutagenesis at other locations in hGH Have also been introduced into 
fie hormone^JlUQevector. 

5 Codons 172. 174. 17eand 178 of hGH were targeted for random mutagenesis lecause they all lie on or 

near the surface of hGH and conlrilxite significantly to receptor-binding (Cunningham and Wells. SEieD£fi244. 
1081-1085 [19891); they alt ie within a weil-deffined structure, occupying 2 turns" on the same side oi hefix 4; 
- andthey are each substilutedbyat least one amino add among known ewlutionaryvar^ 

We chose to substitute NNS {N=A/6/C/r: S=G/C) at each of the target residues. The choice of the 
1 0 NNS degenerate sequence yields 32 possible codons (including at least one codon foreach amino add) at4 sites, 
for a total of (32)4= i,048.576 possible nucleotide sequences, or (20)*= 160.000 possible polypeptide 
sequences. Only one slop codon. amber (TAG), is allowBd by this choice of codons. and this codon is suppressible 

as QIu in supE strains of E- coli. 

Two degenerate oligonudeotides, with NNS at codons 172, 174, 176. and 178, were synthesized. 
1 5 phosphoryfated, and annealed to construct the mutagenic cassetlB: ff-GT-TAC-TCT-ACT-GCT-TTC-AGG- 
AAG-6AC-AT6-GAG-NNS-GTC-NNS-ACA-NNS-CTQ-NNS-ATC-GTG-CAG-TGG-A-3'. and 5'-GA-TGT- 
GCA-CTG-CAC-GAT-SNN-CAG-SNM-TGT-SNN-GAC-SNN-6TC-CAT-GTC-CTT-CCT-GAA-GCA-GTA- 

6A-3'. 

The vector was prepared by digesling pH0509 witft fistEll followed by figUi. The products were run on 
20 a t% agarose gel and the large fragment excised, phenol-extracted, and ethanol precipitated. This fragment was 
treated with calf intestinal phosphatase (Boehringer), then phenolxhtorofomi extracted, ethanol precipitated, 
and resuspended for ligation with the mutagenic cassette. 

pf^fpaqatinn nf the InMqnU^rafy fti yH-Blue cells 

Following ligaton. the reacfion products were again digested with fislEll. then phenolshtorofonn 
25 extracted, ethanol precipitated and resuspended in water. (A fislElI recognition site (GGTNACC) is created 
within cassettes which contain a fi at position 3 of codon 172 and an ACS (Thr) codon at 174. However. 
trealmentwittifislEllat this step should not select against anyof the possible mutagenic cassettes, because 
virtually an cassettes will be heteroduplexes. which cannot be cfeaved by the en^e.) Approximately 150 ng (45 
fmols) of DNA was eiectropotaled Info XLl-Blue cells (1.8 x 10^ cells in 0.045 ml) ina 0.2 cm cuvette at a 
30 voltage setting of 2.49 l<V with a angle pulse (fime constant = 4.7 msec). 

The cells were albwed to recover l hour at 370C in S.0£ media with shalang, then mixed with 25 mL 
2YT medium. 100 mg/mLcarbenicillin. and M13-K07 (moi* 100). After 10' at 230C. the cuBuie was incubated 
overnight (15 hours) at 370C witti shaking. Rating of serial dilutions from this culture onto carbenicillin- 
containing media Indicated that 35 x 10^ elecltotransfonnants were obtained. 
35 After overnight incubation, the eels were pelleted, and double-stranded DNA (dsDNA), designated 

pTO529E (the iralial library), was prepared by the alkaline lysis method. The supernatant was spun again to 
remove any remaining cells, and the phage, designated phage pool (1.H0529E (the initial library of phage), were 
PEG-predpitaled and resuspended in 1 ml STE buffer (1 0 mM Tris. pH 7.6. 1 ml^ EDTA, 50 mM NaCI). Phage 
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titers were measured as coiony-forming units (CPU) for the recombinant phagemid containing hGH-g3p. 
Approximately 4.5 x 10^3 CPU were obtained from the starting library. 
Degeneracy of the starting library 

Prom the pool of electrotransformants, 58 clones were sequenced in the region of the fist£ll-£glli 
5 cassette. Of these. 1 7% corresponded to the starting vector, 1 7% contained at least one frame shift, and 7% 
contained a non-silent (non-terminafing) mutation outside the four target codons. We conclude that 41% of the 
clones were defective by one of the above measures, leaving a total functional pool of 2.0 x 10^ Initial 
transformants. This number still exceeds the possible number of DNA sequences by nearly 20-fold. Therefore, 
we are confident of having all possible sequences represented in the starting library. 
1 0 We examined the sequences of non-selected phage to evaluate the degree of oodon bias in the 

mutagenesis (Table V). The results Indicated that, although some codons (and amino acids) are under- or over- 
represented relative to the random expectation, the library Is extremely diverse, with no evidence of large-scale 
'sibling" degeneracy (Table VI). 

Table V. 

15 

Codon distribution (per 188 codons) of non-selected honnone phage. Clones were sequenced from the starting 
library (pH0529E). All codons were tabulated, including those from clones which contained spurious mutations 
and/or frameshifts. * Note: the amber stop codon (TAG) is suppressed as Glu in XL1-Blue cells. HIghiighted 
codons were over/under-represented by 50% or more. 

20 





Residue 


Number exoected 


Nunfeer found 


Poundi'ExKcted 




Leu 


17j6 


18 


1.0 


25 


Ser 


17j6 


26 


1.5 




Arg 


17.6 


10 


0.57 




Pro 


11.8 


16 


1.4 




Thr 


11.8 


14 


1.2 


30 


Ala 


11.8 


13 


1.1 




Gl/ 


11.8 


16 


1.4 




Val 


1U 


4 


0.3 




le 


5.9 


2 


0.3 


35 


Met 


5.9 


1 


0.2 




Tyr 


5.9 


1 


0.2 




His 


5.9 


2 


0.3 




Tip 


5.9 


2 


0.3 




Rie 


5.9 


5 


0.9 


40 


Cys 


5.9 


5 


0.9 




Gin 


5.9 


7 


1.2 




tan 


5.9 


14 


2.4 




Lys 


5.9 


11 


1.9 




Asp 


5.9 


9 


1.5 


45 


Glu 


5.9 


6 


1.0 




amber* 


5.9 


6 


1.0 
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Table VI. 

5 asGluinXLl-Bluecellsareshownase. 



20 



KeNT 
TWGS 
PeER 

1 0 LPPS 
SLOP 
QQSN 
GSKT 
TPVT 

15 RSRA 
LCGL 
TGRL 
AKAS 
GNDD 



KTcU 


CVLO 


NNCR 


EASL 


FPCL 


SSKE 


NSDF 


ALLL 


HRPS 


PSHP 


LSU 


SYAP 


NGSK 


ASNG 


LTTE 


EANN 


PSGG 


KNAK 


LWFP 


SRGK 


PAGS 


GLDG 


GRAK 


NOP! 


GTNG 





Pfft^ faflon of liTimob» iiy?^t hfiHbo and hPRUx) 

Immobinzed hGHbp ChGHbp-beads') was prepared as described (Bass et al., EflileillsS, 309-314 
ti990]), except that wiW-type hGHbp (Fuh et al., J. Biol. Chem. 265. 3111-3115 [1990]) was used. Competitive 
25 binding expenments with 0 hGH indicated that 58 Imols of fundional hGHbp were coupled per jiL of bead 
suspension. 

Immobilized hPRLbp fhPRLbp^BadS") was prepared as above, using the 21 1-residue extracellular 
domain of the prolactin receptor (Cunningham et al.. SfiiSDCS 250, 1709-1712 [1990)). Competitive binding 
experimBntswith [^25pQH in the presence of 50 mM zinc indicated that 2.1 fmolsof funcBonalhPRLbpwere 
30 couf^ per^ of bead suspension. 

■Blaik beads" virete prepared by treating the oxtrane-aaylamide beads with 0.6 M ethanolamine (pH 

9.2) for 15 hours at 40C. 
ffipftn^yafedtentiiJnglmnyiMla^ 

BftKfing of hormone-phage to beads was carried out in one of the following buffers: Buffer A (PBS, 

35 0.5% BSA, 0.05% Tvwen 20, 0.01% thimerosal) for selections using hGHl^ and Wank beads; Buffer B (50 mM 
trispH 7.5. 10 mM MgCi2. 0.5% BSA. 0.05% Tween 20. 100 mMZnCIa) for selections using hPRLbp in the 
presence of zinc (+ Zn2+); or Buffer C (PBS, 0.5% BSA, 0.05% Tween 20. 0.01% thimerosal. 10 mM EDTA) for 
selections using hPRlJ)p in the absence of zinc (+ EDTA). Binding selections were carried out according to each 
of the following paths: (1) binding to blank beads, (2) binding to hGHbp4»eads, (3) bhding to hPRll)p-beads (+ 

40 Zn2+), (4) binding to hPRLbp-beads (+ EDTA), (5) pre-adsorbing twice with hGHbp beads then binding the 
non-adsod)ed firaclion to hPRLbp-beads 

C-hGHbp, +hPRLbp' selection), or (6) pre-adsorbing twice with hPRLbp-beads then binding fiie non-adsorbed 
fracfion ID hGHbp^)eads ('-hPRLbp. -i-hGHbp- selection). The latter two procedures are expected to enrich 
for mutants binding hPRLbp but not hGHbp. or for mutants binding hGHbp but nothPRLbp. respectively. 
45 BihdingandelutionofphagBwascarriedoutineachcycleasfollows: 

l.BINDING: Analk?uotof homionephage (typically 10^ -10^0 CFU) was mixed with an equal amount of 
non-hormone phage (pCAT). diluted into the appropriate buffer (A. B. or C). and mixed with a 10 mL suspension 



wo 92/09690 



35 



PCT/US91/09133 



of hGHbp, hPRLbp or blank beads In a total volume of 200mL in a 1 .5 mL polypropylene tube. The phage were 
allowed to bind to the beads by incubating 1 hour at room temperature (23^0) with slow rotation (approximately 
7 RPM). Subsequent steps were carried out with a constant volume of 200)iL and at room temperature. 

2. WASHES: The beads were spun 15 sec, and the supernatant was removed. To reduce the number of 
5 phage not specifically bound, the beads were washed 5 times by resuspending briefly in the appropriate buffer, 

then pelleting. 

3. hGH ELUTION: Phage binding weakly to the beads were removed by elutk)n with hGH. The beads 
were rotated with the appropriate buffer containing 400 rM.hGH for 15-17 hours. The supernatant was saved as 
the 'hGH elution' and the beads. The beads were washed by resuspending briefly In buffer and pelleting. 

1 0 4. GLYCINE ELUTION: To remove the tightest-binding phage (Le. those stiH bound after the hGH 

wash), beads were suspended in Glycine buffer (Buffer A plus 0.2 M.6iycine, pH 2.0 with HCI), rotated 1 hour and 
pelleted. The supernatant ("Glycine elution'; 200(xL) was neutralized by adding 30 ml of 1 M Tris base and 
stored at 4^ C. 

5. PROPAGATION: Allquots from the hGH elutions and from the Glycine elutions from each set of 
1 5 beads under each set of condittons were used to infect separate cultures of tog-phase XLI -Blue cells. 

Transductions were carried out by mixing phage with 1 ml XLI -Blue cells, incubating 20 min. at 37^C, then adding 
K07 (moi= 100). Cultures (25 mL 2YT plus carbenfeillin) were grown as described above and the next pool of 
phage was prepared as described above. 

Phage binding, eiutton, and propagation were carried out in successive rounds, according to the cycle 
2 0 described above. For example, the phage amplified from the hGH elutton from hGHbp-beads were again selected 
on hGHbp-beads and eluted with hGH, then used to infect a new culture of XLI -Blue cells. Three to five rounds of 
selectton and propagatton were canied out for each of the selection procedures described above. 
DMA Sequen(dn9 of selected Dhaq^^ 

From the hGH and Glycine elution steps of each cycle, an aliquot of phage was used to inoculate XLI-Blue 
2 5 cells, which were plated on LB media containing carbenidllin and tetracycline to obtain independent clones from 
each phage pool. Single-stranded ONA was prepared from Isolated cotony and sequenced In the regton of the 
mutagenic cassette. The reajlts of DNA sequencing are summarized in temns of the deduced amino acM sequences 
In Figures 5, 6, 7, and 8. 

30 Expression and assay Of hGH mutants 

To detennine the binding affinity of some of the selected hGH mutants for the hGHbp, we transformed 
DNA from sequenced ctones into£j^ strain 16C9. As described above, this is a non-suppressor strain which 
tenninates translation of protein after the final Phe-191 resMue of hGH. Single-stranded DNA was used for 
these transformatk)ns, but double-stranded DNA or even whole phage can be easily electroporated into a non- 
35 suppressor strain for expression of free homtione. 

Mutants of hGH were prepared from osmotically shocked cells by ammonium sulfate precipitation as 
described for hGH (Olson et al., Mlu:&293, 408-41 1 [1981]), and protein concentrations were measi^-ed by 
laser densitomoetry of Coomassie-stalned SDS-polyacrylamide gel electrophoresis gels, using hGH as standard 
(Cunningham and Wells, SciSlU£244, 108M085 [1989]). 
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10 



15 



20 



25 



30 



35 



4Q 



45 



The binding affini^ of each mutant was determined by displacement of ^ 25| hGH as described (Spencer 
et al.. .1 Rinl-Chein. 263. 7862-7867 11988] ; Fuh et al., J- Blot. Cham. 265. 31 1 1-31 15 11990», using an anli- 
leceptor monoclonal antiboc^ (Mab263}. 

The results for a number of hGH mutants, selected by different pathways {Fig. 6) are shown in Table 
VIL Many of these mutants have a tighter binding affinity for hOHbp than wild-type hGH. The most improved 
mutant, KSW, has a bincSng affinity 5.6 times greater than that of wid-type hGH. The weakest selected 
mutant, among those assayed was only about lO'-fbId lower In binding affinity than hGH. 

Binding assays may be carried out for mutants selected for hPRLi)p-binding. 

Tat)teAni. 
Compeiitive bindng to hCHbp 

The selected pool in which each mutant was found is indicated as 1 G (first glycine s^edkm), 3G (third 
glycine selection), 3H (third hGH selection), 3" (third selecfion, not binding to hPRLbp, but binding to hGHbp). 
Thenumberof times each mutant occuned among all sec^nced clones is shown Q. 



Mutant 


Kd (nM) 


Kd(mut)/Kd(hGH) 


Pool 


KSYR (6) 


0.06 + 0.01 


0.18 


1G,3G 


RSFR 


0.10 + 0.05 


0.30 


3G 


RAYR 


0.13 + 0.04 


0.37 


3* 


fOYK (2) 


0.16 + 0.04 


0.47 


H.3G 


RSYR (3) 


0.20 + 0.07 


0.58 


1G,3H.3G 


KAYR (3) 


0.22 + 0.03 


0.66 


3G 


RFFR (2) 


0.26 + 0.05 


0.76 


3H 


KQYR 


0.33 + 0.03 


1.0 


3G 


KEFR= wt (9) 


0.34+ 0.05 


1.0 


3H,3G.3* 


RTYH 


0.68 + 0.17 


2.0 


3H 


QRYR 


0.83 + 0.14 


2.5 


3* 


KKYK 


1.1 +0.4 


3.2 


3* 


RSFS (2) 


1.1 +0.2 


3.3 


3G,* 


KSNR 


3.1 +0.4 


9.2 


3* 



Additive and noiv afMHive effects on btnding 

At some residues, substitution of a particular amino acid has essentially Ihe same effect independent of 
50 surrounding reiSdues. For example, substitution of F176Y in the background of 172R/174S reduces binding 
affinity by 2.0-fold (RSFR vs. RSYR). Simaarfy, in the baciground of 172K/174A the binding affinity of the 
F176Y mutant (KAYR) is 2.9-fbld weaker than flie corresponding 176F mutant (KAFR; Cunningham and Wells. 
1989). 
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On the other hand, the binding constants determined for several selected mutants of hGH demonstrate 
non-additive effects of some amino acid substitutions at residues 172, 174, 176, and 178. For example, in the 
background of 172K/176Y. the substitution E174S results in a mutant (KSYR) which binds hGHbp 3.7-fold 
tighter than the corresponding mutant containing E174A (KAYR). However, in the background of 172R/176Y, 
5 the effects of these El 74 substitutions are reversed. Here, the E174A mutant (RAYR) binds 1.5-fbld tighter 
than the E174S mutant (RSYR). 

Such non-additive effects on binding for substitutions at proximal resUues illustrate the utility of 
protein-phage binding selectton as a means of selecting optimized mutants from a library randomized at several 
positions. In the absence of detailed stmctura! information, without such a selection process, many combinattons of 
1 0 substitutions might be tried before finding the optimum mutant. 

EXAMPLE IX 

SELECTION OF hGH VARIANTS FROM A HEUX-1 RANDOM CASSEHE 
1 5 UBRARY OF HORMONE-PHAGE 

Using the methods described in Example VIII, we targeted another region of hGH involved in binding to 
the hGHbp and/or hPRLbp, helix 1 residues 1 0, 14, 18, 21 , for random mutagenesis in the phGHam-g3p vector 
(also known as pS0643; see Example VIII). 

20 We chose to use the "amber" hGH-gS construct (called phGHam-gSp) because it appears to make the 

target protein, hGH, more accessible for binding. This is supported by data from comparative ELISA assays of 
monoclonal antibody binding. Phage produced from both pS0l32 (S. Bass, R. Greene, J. A. Wells, Proteins 8, 
309 (1990).) and phGHam-g3 were tested with three antibodies (Medix 2, 1B5.G2, and 5B7.G10) that are known 
to have binding determinants near the cariMxyl-terminus of hGH [B. C. Cunningham, P. Jhurani, P. Ng, J. A. Wells, 

25 SdenceTAZ, 1330 (1989); B. C. Cunningham and J. A. Wells, ScienceM, 1081 (1989); L Jin and J. WeUs, 
unpublished results], and one artf body (Medix 1 ) that recognizes determinants in helices 1 and 3 ([B. C. 
Cunningham, P. Jhurani, P. Ng, J. A. Wells, SdenceiAZ, 1330 (1989); B. C. Cunningham and J. A. Wells, Sdence 
244, 1081 (1989)]). Phagemid particles from phGHam-g3 reacted much more strongly with antibodies Medix 2, 
1B5.G2, and 5B7.C10 than did phagemid particles from pS0132. In partk:ular, binding of pS0132 particles was 

30 reduced by >2000-fbki for both Medix 2 and 5B7.C10 and reduced by >25-foki for 1B5.G2 compared to binding 
to Medix 1. On the other hand, binding of phGHam-g3 phage was weaker by only about 1 .5*fbld, 1 .2-fold, and 2.3- 
foki for the Medix 2, 1B5.G2, and 5B7.C10 antibodies, respectively, compared with binding to MEDIX 1. 
ConstructkMi of the helix 1 library bv cassette mutagenesis 

We mutated residues in helix 1 that were previously kjentified by alanine-scanning mutagenesis [B. C. 

35 Cunningham, P. Jhurani, P. Ng, J. A. Wells, Science 243, 1330 (1989); B. C. Cunningham and J. A. Wells, Science 
244, 1081 (1989), 15, 16) to modulate the binding of the extracellular domains of the hGH and/or hPRL 
receptors (called hGHbp and hPRLbp, respectively). Cassette mutagenesis was earned out essentially as 
described [J. A. Wells, M. Vasser, D. B. Powers, Gene 34, 315 (1985)]. This library was constructed by cassette 
mutagenesis that fully mutated four residues at a time (see Example Vill) which utilized a mutated version of 

4 0 phGHam-g3 into which unique KprH (at hGH codon 27)and Xho\ (at hGH codon 6) restriction sites (underlined 
betow) had been inserted by mutageneses [ T. A. Kunkel, J. D. Roberts, R. A. Zakour, hSethods Enzymol. 154, 367- 
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38q with the oKgonucleoticfes 5'-GCC TTT 6AC AfifiJACJJAG GAG TTT G-S" and ff-CCA ACT ATA CCA 
R TCTCGAfiS TCT ATT CGA TAA C-3*, respectively. The later ol^o also itilnoduced a +1 frameshift 
(ilalcfzed) to teimmate translation from the starting vector and minimize wild-type background in the phagemid 
litKary. This strafing vector was designated pH0508B. The heDx 1 ibrary, which mutated hGH residues 1 0, 14, 
5 t8. 21. was constructed by ligaHng to the large Xho\-Kpn\ fragment of pH0508B a cassette made from the 
complementary ofigonucleotides 5"-pTGG AGG CTC NNS GAG AAC GCG NNS GTG CGT GOT NNS CGT GIT 
NNS GAG GTG GGG TTT GAG AGG TAG-3' and 5'-pGT GTG AAA GGG CAG GTG SNN AAG AGG SNN AGG 
AGG GAG SNN GGG GTT GTG SNN GAG GG-3'. The KprA site was desfroyed in the junction of the ligation 
product so that restriclion enzyme digestion could be used for analysis of non-mutated background. 
10 The library contained at least 10^ independent transfbrmants so that if the library were absolutely 

random {1 0^ different combinalions of codons) we would have an average of about 1 0 copies of each possible 
mutated hGH gene. Restriction analysis using Kpi* Indicated that at least 80% of helix 1 library constructs 

contained the ins^led cassette. 

B&iding enrichments of hGH-phagefirom the ibraries was carried out using hGHbp immobilized on 

15 oxiranei)olyacrylamklebeads(SigrnaChemkalCo.)asdascribed(ExampleVIII). Four residues in helx 1 (F10. 
MHr H18. and H21) were similariy mutated and after 4and 8 cycles a no^^wiW-type consensus developed (Table 
VIII). Position 10 on the hydrophobic face of helix l tended to be hydrophobic whereas posittons 21 and 18 on the 
hydrophlBic fece fended wrere dominated by Asn; no obvious consensus was evkfent for position 1 4 (Table IX). 
The bindmg corstants for these mutants of hGH to hGHbp was determined by expressing the free 

20 hormone variants in the non-suppressor £ co/r strain 16G9. purifying the protein, and assaying by competitive 
displacement of labelled wt-iiGH from hGHbp (see Example VIII). As indtcated, several mutants bind tighter to 
hGHbp than doeswt-hGH. 
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Table VIII. 
Selection of hGH helix 1 mutants 

Variants of hGH (randomly mutated at residues F10, M14, H18, H21) expressed on phagemid 
5 particles were selected by binding to hGi-itsp-beads and eluting witti liGiH (0.4 miU) buffer 
foliowed by glycine (0.2 M, pH 2) buffer (see Example VIII). 



Gly elution 



FIO 


M14 


H18 


H21 




4 Cycles 
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Table IX 

Cosensusssquencestamlheseleciedhelixflilxary 

5 Observed frequerKy is fraction of all (tones setyjeixed vBth the ire The nominal frequency Is 

calcuialed on the basis of NNS 32 codon degeneracy. The maximal enrichment factor variesfrom 11 to 32 
depending upon the nominal frequency value for a given residue. Values of |Kd(Ala mut)/Kd{wt hGH)] Ibr single 
alanine mulationswerB taken from BX.CunningharaandJ.A.Welis, S(*/Jce244J08t(1989)rB.C.Cirnir»gte^^ 
D. J. Henner, J. A. VWells. Science 247. 1461 (1990); B. G. Cunningham and J. A. Wells, Pmc, NatL Acad. Scl USA 
10 88,3407 (1991). 
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Table X 

BJndIng of purified hGH helix 1 mutams lo hGHbp 

5 Competition binding experiments were perfomied using [^^^QhGH (wild-type), hGHbp (containing the 
extracellular receptor domain, residues 1-238). and Mab263 [B. C. Cunningham, P. Jhurani, P. Ng, J. A. Welts, 
Science 243, 1 330 (1989)];. The number P Indicates the fractional occurrence of each mutant among all the clones 
sequenced after one or more rounds of selection. 
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EXAMPLE X 

SELECTION OF hGH VARIANTS FROM A HELIX-4 RANDOM CASSEHE UBRARY CONTAINING 
PREVIOUSLY FOUND MUTATIONS BY ENRICHMENT OF HORMONE-PH AGE 

30 Design nf mutant nmtelns wHh Improved MndlnQ prpnertles bv iteralive setectfon using hormonMhaae 

Our experience with recruiting non4>inding homologs of hGH evolutionary variants suggests that many 
individual amino acM substitutions can be combined to yield cumulatively improved mutants of hGH with respect to 
binding a particular receptor [B. 0. Cunningham, D. J. Henner. J. A, Wells, Science 247, 1461 (1990); B. C. 
Cunningham and J. A. Wells, Proc. NatL Acad Sa. USA 88, 3407 (1991); H. B. Lowman, B. C. Cunningham, J. A. 

35 Wells, J. BioL Chem. 266, in press (1991)]. 

The helix 4b library was constructed in an attempt to further improve the helix 4 double mutant 
(E174S/F176Y) selected from the helix 4a library that we found bound tighter to the hGH receptor (see 
Example VIII). With the E174S/F176Y hGH mutant as the bacl^ground starting hormone, residues were mutated 
that sun-ounded positions 174 and 176 on the hydrophilic face of helix 4 (R167, 0171,7175 and 1179) . 

40 Construction of the helix 4b library bv cassette mutagenesis 

Cassette mutagenesis was canied out essentialy as described [J. A. Wells, M. Vasser, D. B. Powers, 
Gene34, 315 (1985)]. The helix 4b library, which mutated residues 167, 171, 175 and 179 within the 
E174S/F176Y baclcground, was constructed using cassette mutagenesis that fully mutated four residues at a 
time (see Example VIII) and which utilized a mutated version of phGHam-g3 into which unique SsfBI and Bgl{\ 
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restricfion sites had been inserted previously (Example Vlll). mio the BsiB\-m sites of the vectorwas inserted 
a cassette marie from the complementary oligonudeolides 5'-pa TTA CTC TAC TGC TTC NNS AAQ GAC ATG 
NNS AAG GTC AGC NNS TAC CTGCGC NNS GTG CAG TGC A-3' and 5'-pGA TCTGCA CTG CAC SNN 
GCG CAG GTA SNN GCT GAC CTT SNN CAT GTC CTT SNN 6AA GCA GTA GA-3'. The BstBl site was 
5 elmfrated in the ligaled cassette. FromtheheOx4bBbrary,15unselecteddonesweresequenced. Oflhese.none 
laded a cassette insert, 20% were frame-shifted, and 7% had a non-silent mutation. 
BffMiHtt rfhfiHhn aiffchment 

Binding enrichments of hGH-phage from the lilxaries was carried out using hGHbp immobilized on 
oxirane-polyacrylamide beads (Sigma Chemical Co.) as described (Example VIII). After 6 cydes of binding a 
1 0 reasonably dear consensus developed (Table XI). Interestingly, all positions tended to containpolar residues, 
notably Ser, Thr and Asn (XII). 
Assay of hBH mutants 

The binrftig constants for some of these mutants of hGH to hGHbp was detennlned by expresSng the 
free hormone variants in the non-suppressor £ coff strain t6C9, purifying the protein, and assaying by competitive 
15 displaceraent of labelled wt-hGH from hGHbp (see Example VIII). As indicated, the binding alfinilies of several 
heio(-4b mutants for hGHbp were fighter than that of wt-hQH Table XIII). 
Roffoptflr-MieeUvthrof hGH variants 

Finaify, we have begun to analyze flie binding affinity of several of the tighter hGHbp binding mutants 
for their abili^ to bind to the hPRU)p. The E174S/F176Y mutant binds 200-fold weaker to the hPRLbp than 
20 hGH. The E174TyF176Y/R178K and R167N/D171S/E174S/F176Y/1179T mutants each bind >500-fOld weaker 
to the hPRLbp than hGH. Thus, it is possible to use the produce new receptor seledive mutants of hGH by phage 
displaytechnoiogy. 

Hnimnnfl-Hhaaemld sek ^tinn fdenffles ttw Infrvmalion-cantPnt of narticular residues 

Of the 12 residues mutated in three hGH-phagemid libraries (Examples Vlll, IX, X), 4 showed a strong, 

25 although not exclusive, consenration of the wild-type residues (K172, T175. F176, and R178). Notsurpriangly. 
these were residues that when converted to Ala caused the ia^st disrupfions (4- to 60-fold) in binding affinity 
to the hGHbp. There wasadass of 4 other residues (F10, Mt4, D171. and H79) where Ala subsBtutions caused 
weaker effects on binding (2- to 7-fold) and these positions exhibited little wild-type consensus. Rnally the other 
4 residues (H18, H2t, R167, and E174), that promrrte binding to he hPRlip but not ttie hGHbp, tfd not exhibit 

30 anyconsensus for the mW-^ hGH sequence by selection on hGHbp^BSKfs. In facttwo residues (E174 and 
H2t), where Ala substitulfons enhance binding afiinity to the hGHbp by 2- to 4-fold p. C. Cunningham, P. 
Jhurani, P. Ng. J. A. Wells. Science 243, 1330 (1989); B. C. Ciiiningham and J. A. Weils, Scrence 244, 1081 
(1989); B. C. Cunningham,0. J. Henner, J. A. Wells, Science2A7, 1461 (1990); B. C. Cinningham and J. A. Wells, 
Pmc NaS. Acad. Sci. USA 88, 3407 (1991)]. Thus, the alanine-scanning mutagenesis data correlates reasonably 

35 welt virith the flexibility to substitute each position. In fact , the reduction in binding affinity caused by alanine 
substMons IB. C. Cunningham, P. Jhurani, P. Ng, J. A. Wells, Science 243, 1330 (1989); B. C. Cunningham and 
J. A. Wells, Sdencem. 1081 (1989)], B. 0. Cunningham, D. J. Henner, J. A. Wells, Scence 247, 1461 (1990); B. 
C. Cunningham and J. A. Wells, Proc. Natl. Acad. Sci. USA 88. 3407 (1991 )] is a reasonable predictor of the 
percentage that the wild-type residue is found in the phagemid pool after 3^ rounds of selection. The alanine- 
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scanning information is usefui for targeting side-chains that modutate tending, and the phage selection is 
appropriate for optimizing them and defining the flexit)ility of each site (and/or combinations of sites) for 
sulKtitulion. The combination of scanning mutational methods [B. C. Cunningham, P. Jhurani, P. Ng, J. A. Wells, 
Science 243, 1330 (1989); B. C. Cunningham and J. A. Wells, Sdence 244, 1081 (1989)] and phage display is a 
5 powerful approach to designing receptor-llgand interfaces and studying molecular evolution in vitro. 
Variations on iterative enrichment of hornione-DhaaemM Ubraries 

In cases where combined mutations in hGH have additive effects on binding affinity to receptor, 
mutations learned through hormone-phagemid enrichment to improve binding can be combined by simple cutting 
and ligafion of restriction fragments or mutagenesis to yield cumulatively optimized mutants of hGH. 

10 On the other hand, mutations in one region of hGH which optimize receptor binding may be stmcturally or 

functionally incompatibie with mutations in an overlapping or another region of the molecule. In these cases, 
hormone phagemid enrichment can be canried out by one of several variations on the iterative enriOunent approach : 
(1) random DNA libraries can be generated In each of two (or perhaps more) regions of the molecule by cassette 
or another mutagenesis method. Thereafter, a combined library can be created by ligation of restriction 

1 5 fragments from the two DNA libraries; (2) an hGH variant, optimized for binding by mutation in one region of the 
molecule, can be randomly mutated in a second region of the molecule as in the helix-4b library example; (3) two or 
more random libraries can be partially selected for improved binding by hormone-phagemid enrichment; after this 
'roughing-in' of the optimized binding site, the still*partially-diverse libraries can be recombined by ligation of 
restriction fragments to generate a single library, partially diverse in two or more regions of the molecules, which 

20 in turn can be further selected for optimized binding using hormone-phagemid enrichment. 
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Table XI. 

Mufamphasemidsof hGH selecfedf^ 

Selecfion of hGH helix 4b mutants (randomly mutated at residues 167, 171 , 1 75, 179). each containing the E174S/F176Y 
double mulant. by binding to hGHbp-beads and eluting with hGH (0.4 mW) buffer followed by glycine (0.2 M, pH 2) buffer. 
5 One mutant (+) contained the spurious mutation R178H. 
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Table XII 

Consensus sequences Inm the selecM 
Observed frequency is fraction of all clones sequenced with the indicated amino add. The nominai frequency 
is calculated on the basis of NNS 32 codon degeneracy. The maximal enrichment factor varies from 11 to 1 6 to 32 
5 depending upon the nominal frequency value for a given residue. Values of {Kcj(Ala mut)/K(j(wt hGH)) for single 
alanine mutations were taken from refs. below; for position 1 75 we only have a value for the T1 75S mutant [B. C. 
Cunningham. P. Jhuranl. P. Ng, J. A. Wells, Science 243, 1330 (1989); B. C. Cunningham and J. A. Wells, Sdence 244, 
1081 (1989); B. 0. Cunningham, D. J. Henner, J. A. Wells, Science2A7, 1461 (1990); B. C. Cunningham and J. A. Weils, 
Proc. NatL Acad ScL USA 88, 3407 (1991).]. 
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Table xia 

Binding of pumed hGH mulants to hGHbp. 

Competition binding experiments were perfonned using [^^IJIiGH (wild-type), hGHbp (contanning the 
30 extracelular receptor domain, residues 1-238). and Mab263 (11). The number P indicates the fractbnal 

occurrence of each mutant among all the clones sequenced after one or more rounds of selection. Note that the 
helix 4b mutations (*) are in the background of hGH(E174S/F176Y]. In the list of helix 4b mutants,, the 
E174S/F176Y mutant 0, with wt residues at 167. 171 , 175. 179. is shown in bold. 
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Assenfei^or Fdb Moieculeonfhe 

ConsitueiiQnofDiasmkIs 

Plasmid pDH 188 contains the DNA encoding the Fab po^'on of a humanized IgG antibody, called 4E)5, 
5 that recognizes the HER-2 receptor. This plasmid is contained in £ coff strain SR 101 . and has been deposited 
with the ATCC in Rockville. MD, 

Briefly, the piasmid was prepared as follows: the starting piasmid was pS0132. containing the alkaline 
phosphatase promoter as described above. The DNA encoding human growth hormone was exdsed and, after a 
series of manipulations to make the ends of the plasmid compatible for ligation, the DNA encoding 4D5 was 

1 0 inserted. The 4D5 DNA contains two genes. The first gene encodes the variable and constant regions of the light 
chain, and contains at its 5' end the DNA encoding the st li signal sequence. The wcond gene contains lour 
portions: first, at its 5* end is the DNA encoding the st II signal sequence. TWs is followed by the DNA encoding 
the variable domain of the heavy chain, whfch Is foltowed by the DNA erwxling the first domain of the heavy chain 
constantregton,whichfalurnisfoitowedbytheDNAencodngtheM13gere III. The salient features of this 

15 constmctareshowninFeurelO. The sequence of the DNA enco(«ng4D5 Is shown in Fgure 11. 
Pcflr/ttansfarmaHnnaivi^ 

Both polyethylene glycol (PEG) and electroporation were tjsed to transform plasmids into SR101 ceils. 
(PEG competent cells were prepared and Iransfonned according to the method of Chung and Miller [Nucleic 
Atids ftes. 163580 [1988)). Cells that were competent for electroporatton were prepared, and subsequently 

20 transformed via eleclroporaBon according to the method of Zabarovsky and Winberg (A/uc/e/c Mds Res, 18S912 
[1990]). After pfacing the ceils in 1 ml of the SOC media (described in Sambrook etal., supra), they were grown 
for 1 hour at 37°C with shaking. At this time, the concentration of the cells was detemiined using light scattering 
at ODeoo- A titered K07 phage stock was added to achieve an multiplicity of infecton (MOI) of 100. and the 
phage were allowed to adhere to the cells for 20 minutes at room temperature. This mixture was then diluted into 

25 25 mis of 2YT broth (described in Sambrook ef al., supra) and incubated with shaking at 37*C overnight. The 
next day, ceils were pelleted by centrifugation at 5000 xg for 10 minutes, the supematentwas collected, and the 
phage particles were pretfpitated with 0.5 M NaCI and 4% PES (final concentration) at room temperature for 10 
minutes. Phage particles were pelleted by centrifugatton at t0.000 xg for 10 minutes, resuspended In 1 ml of TEN 
(10 mMTris, pH 7.6, 1 mM EDTA, and 150 mM NaCI), and stored at 4^C. 

30 Pmductfan of an tigen coated Plates. 

Alkiuots of 0.5 ml from a solutton of 0.1 mg/ml of the extra-cellular domain of the HER-2 antigen (ECD) 
or a solution of 0.5 mg/ml of BSA (control antigen) in 0.1 M sodium bfcariaonate, pH 85 were used to coat one well 
of a Falcon 12 well tissue culture plate. Once the solution was applied to the wells, the plates were incubated at 
4"»Con a rocking platform ovemight. The plates were then bk>cked by removing the initial solution, applying 0.5 ml 

35 of blocking buffer (30 mg/ml BSA in 0,1 M sodium bicarbonate), and incubating at room temperature for one hour. 
Finally, the btocking buffer was removed, 1 ml of buffer A (PBS, 0.5% BSA, and 0.05% Tween-20) was added, 
and the plates were stored up to 10 days at 4X before being used for phage selection. 
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Phage fietecflonprncess. 

Approxknateiy 10^ phage particles were mixed with a 100-fold excess of K07 helper phage and 1 ml of 
buffer A . This mixture was divided into two 05 ml aliquots; one of which was applied to ECD coated wells, and 
the other was applied to BSA coated wells. The plates were Incubated at room temperature while shaking for one 
5 to three hours, and were then washed three times over a period of 30 minutes with 1 ml aliquots of buffer A. 
Elution of the phage from the plates was done at room temperature by one of two methods: 1 ) an initial overnight 
incubation of 0.025 mg/ml purified Mu4D5 antibody (murine) followed by a 30 minute Incubation with 0.4 ml of 
the add elution buffer (0.2 M glycine, pH 2.1 , 0.5% BSA, and 0.05% Tween-20), or 2) an incubation with the acid 
elution buffer alone. Eluates were then neuTalized with 1 M Iris base, and a 05 mi aliquot of TEN was added. 
1 0 These samples were then propagated, titered, and stored at 4"^:. 

praggpropMaBon 

Aliquots of eluted phage were added to 0.4 ml of 2YT broth and mixed with approximately 10^ mid-tog 
phase male E co// strain SR101. Phs^ were allowed to adhere to the cells tor 20 minutes at room temperature 
and then added to 5 ml of 2YT broth that contained 50 ug/ml of carbenicillin and 5 |ig/mi of tetracycline. These 
1 5 cells were grown at 37''C tor 4 to 8 hours until they reached mid-log phase. The ODeoo was determined, and the 
cells were superinfected with K07 helper phage for phage production. Once phage particles were obtained, they 
were titered in order to determine the number of colony forming units (cfu). This was done by taking aliquots of 
serial dilutions of a given phage stock, allowing them to infect mid-log phase SR101 , and plating on LB plates 
containing 50 vglm\ carbenicillin. 

20 Ri A affinity tietermlnatlQn, 

The affinity of h4D5 Fab fragments and Fab Phage for the ECO antigen was detemiined using a 
competitive receptor binding RIA (Burt, D. R., Recef^or Binding in Dmg Research. O'Brien, R.A. (Ed.), pp. 3- 
29, Dekker, New York [1986]). The ECD antigen was labeled with ^^S.iodjne using the sequential chtoramine-T 
method (De l^co, J. E et al., J. Ceil Phy^oL 109:143-152 [1981]) which produced a radioactive tracer with a 

25 specific activity of 14iiCi/|ig and incorporation of 0.47 moles of Iodine per mole of receptor. A series of 0.2 ml 
solutions containing 0.5 ng (by ELISA) of Fab or Fab Pt^9e> SO nCi of ^^1 ECD tracer, and a range of unlabeled 
ECD amounts (6.4 ng to 3277ng) were prepared and incubated at room temperature overnight The labeled 
ECD-Fab or ECD-Fab phage complex was separated from the unbound labeled antigen by fomiing an aggregate 
complex induced by the addition of an anti-human IgG (Fitzgerald 40-GH23) and 6% PEG 8000. The complex was 

3 0 pelleted by centrifugation (1 5,000 x g for 20 minutes) and the amount of labeled ECD (in cpm) was determined 
by a gamma counter. The dissociation constant (K<j) was cateulated by emptoying a modified version of the 
program LIGAND (Munson, P. and Rothbard, D., Anal Blochem, 107220-239 [1980]) which utilizes Scatchard 
analysis (Scatchard, G.,Ann. NX Acad. Sri. 51.S60-672 [1949]). The Kd values are shown in Figure 13. 
Competitive cell bindinq assav 

35 Murine 4D5 antibody was labeled with 125-1 to a specific activity of 40-50 (iCi/iLig using the todogen 

procedure. Soluttons containing a constant amount of labeled antibody and increasing amounts of unlabefed variant 
Fab were prepared and added to near confluent cultures of SK-BR-3 cells grown in 96-weli microtiter dishes 
(final concentration of labeled antibody was 0.1 nM). After an overnight incubation at 4''C, the supernatant was 
removed, the cells were washed and the cell associated radtoactivity was determined in a gamma counter. Kd 
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values were detemlned by analyzii^ the data using a modfied version of the program LIGAND (Munson, P. and 
Ridhbard. D., supra} 

This depMit of piasmid pDH188 ATCC no. 68663 was made under the provisions of the Budapest 
Treaty on the Internal onal Recognition of the Deposit of Microorganisms for the Purpose of Patent Procedure 
5 and the ReguIatiorB thereunder (Budapest Treaty). This assures maintenance of a viable culture for 30 years 
from the date of deposit The organisms will be made available by ATCC under the temis of the Budapest 
Treaty, and sutjject to an agreement between Genentech. Inc. and ATCC. which assires pemianent and 
unrestricted availability of the progeny of the cultures to the public upon issuance of the pertinent U.S. patent or 
upon laying open to the public of any U. S. or foreign patent application, whichewercomes first, and assures 

10 availability of the progeny to one determined by the U.S. Commissioner of Patents and Trademari^ 

thereto according to 35 USC §1 22 and the Commissioners rules pursuant thereto (Induding 37 CFR §t.l4 with 
particular reference to 886 OS 638). 

The assignee of the present applfcafion has agreed that If the cutures on deposit should die or be lost 
or destroyed when culfivaled under suitable conditions, Ihey will be promptly replaced on notiicalionwith a viable 

15 specimen ofthe same culture. Availability of the deposited cultures is not to be constmed as a license to pracBce 
the rnvenlion in contravention of the rights granted under the authority of any government in accordance with its 
patent laws. 

The foregoing written specifKation is consWered to be suffia'ent to enable one skilled in the art to 
pracfice the invention. The present invenSon is not to be limited in scope by the cultures deposited, since the 

20 deposited embodiments are intended as separate illustrations of certain aspects of the invention and any cultures 
that are functionally equivalent are within the scope of this invention. The deposit of material herein does not 
constitute an admission that the written description herein contained is inadequate to enable the practice of any 
aspect of the invention, including the best mode thereof, nor is it to be constmed as limiting the scope of the 
claims to the specific illustrations that it represents. Indeed, various modiflcafions of the irwention in addition to 

25 those shown and described herein will become apparent to those skilled in the art from the foregoing description 
and fall within the scope of the appended dalras. 

While fte iroenffon has necessarily been described in conjunction with preferred embodiment, one of 
ordinary sidll, after reading the foregoing spedlicafon, wll be able to effect various changes, substitutions of 
equivalents, and alterations to the subject matter set forth herein, without departing from the spirit and scope 

30 lf»reof. Hence, the invenfion can be practiced In ways other than those speclficalty described herein, ft is 

therefore intended that the protection granted by Letters Patent hereon be limited only by the appended claims 
arxl equivalents thereof. 
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EXAMPLE XII 

SELECTION OF hGH VARIANTS FROM COMBINATIONS OF HELIX-1 AND HEUX-4 HORMONE-PHAGE 

VARIANTS 

5 ConstnieHonoffadrfiflwyartants 

According to acMitivit/ principles [J. A. Wells, BiochemistryZB, 8509 (1990)], mutations In different 
parts of a protein, if they are not mutually interacting, are expected to comt)ine to produce additive changes in the 
free energy of binding to another molecule (changes are additive in terms of AAGunding* or multiplicative in 
terms of Kd exp(-AG/RT] ). Thus a mutation producing a 2-fold increase in binding affinity, when combined 

1 0 with a second mutation causing a S-fold increase, would be predicted to yield a double mutant with a 6-fold 
increased affinity over the starting variant. 

To test whether multiple mutations obtained from hQH-phage selections would produce cumulatively 
favorable effects on hGHbp (hQH-binding protein; the extracellular domain of the hGH receptor) binding, we 
combined mutations found in the three tightest-binding variants of hGH from the helix-1 library (Example IX: 

1 5 F10A/M14W/H18D/H21N, F10H/M14G/H18N/H21N, and F10F/M14S/H18F/H21L) with those found In the 
three tightest binding variants found in the helix-4b library (Example X: R167N/0171S/ri75/l179T, 
R167E/D171S/T175/I179, and R167N/D171N/T175/I179T). 

hGH-phagemid double-stranded DNA (dsDNA) from each of the one-helix variants was isolated and 
digested with the restriction enzymes EcoRi and BsM. The large fragment from each heiix-4b variant was then 

20 isolated and ligated with the small fragment from each helix-1 variant to yield the new two-helix variants shown in 
Table XIII. All of these variants also contained the mutations E174S/F176Y obtained in earlier hGH-phage binding 
selections (see Example X for details). 
Construction of selective comblnatoria! libraries of hGH 

Although additivity prindples appear to hold for a number of combinations of mutations, some 

2 5 combinations (e.g. El 74S with F1 76Y) are clearly non-additive (see examples VIII and X). In order to identify 

with certainty the tightest binding variant with, for example, 4 mutations in helix-l ^Qd 4 mutations in helix-4, one 
would ideally mutate all 8 residues at once and then sort the pool for the globally tightest binding variant. 
However, such a pool would consist of 1.1 x 10^^ DNA sequences (utilizing NNS codon degeneracy) encoding 2.6 x 
10^0 different polypeptides. Obtaining a random phagemid library large enough to assure representation of all 

30 variants (pertiaps 10^3 transfonnants) is not practical using current transfonnation technology. 

We have addressed this difficulty first by utilizing successive rounds of mutagenesis, taking the 
tightest binding variant from one library, then mutating other residues to further improve binding (Example X). 
in a second method, we have utilized the principle of additivity to combine the best mutetions from two 
independently sorted libraries to create multiple mutants with improved binding (described above). Here, we 

35 further searched through the possible combinations of mutations at positions 10, 14, 18, 21 , 167, 171, 175, and 
179 in hGH, by creating combinatorial libraries of random or partially-random mutants. We constnicted three 
different combinatorial libraries of hGH-phagemids, using the pooled phagemids from the helix 1 library 
(Independently sorted for 0, 2, or 4 cycles; Example IX) and the pool from the heiix-4b library (Independently 
sorted for 0, 2, or 4 cycles; Example X) and sorted the combined variant poo! for hGHbp binding. Since some 

40 amount of sequence diversity exists in each of these pools, the resulting combinatorial library can explore more 
sequence combinations than what we might constnjct manually (e.g. Table Xlil). 
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hGH-phagefflW douWe^tranded DNA (dsDN^ ftom «ach of the one-helix libiafy pools (selected for 0, 
2. or4 rounds) was isolated and digested with the lestiiction enzymes Acd and BstXl. The large fragment from 
each helx-1 variant pool was then isolated and Bgated wilh the small fragment from each helix-4b variant pool to 
yield ttie three ooml»atocSal llxaries pH0707A (unselected helix i and helix 4b pools, as described in examples IX 
5 and X). pH07(J7B (twice-selected hellx-1 pool with twice-selected helix-4b pool), and pH0707C (4-times 

selected helix-l pool vnth 4-times selected helix-4b pool). Duplicate ligations were also set up with less DNA and 
designated as pH0707D, pH0707E. and pH0707F. corresponding to the 0-,2-, and 4-round starting libraries 
respectively. All of these variant pools also contained the mutations E174S/F176Y obtained in earler hGH-phage 
binding selections (see Example X fordeteils). 
10 Snrfnn enmblnat nrial llbraHes of hRH^e variants 

The ligatron products pH0707A-F were processed and electro-transformed Into XLI-Blue cells as 
described (Example VIII). Based on colony-forming units (CFU), the number (rf tansfbmiants obtained fifom each 
pool was as fbfcws: 2.4x106 from pH0707A. 1.8xl06 fix>m pHOTOTB, 1.6x106 from pH0707C, SxloS from 
PH07D7D. 3xlo5 frwn pH0707E, and 4x105 from pH0707F. hGH-phagemid parfidas were prepared and 
1 5 selected for hGHfap-Wndingover 2 to 7 cycles as described in Exampte VIB. 
BapM Mrtng of HftH-phanimM Hhraites 

fn addition to sorting phagemid libran'es for tight-binding protein variants, as measured by equilibrium 
binding affinity, it is of interest to sort for variante which are altered in either the on-rate (l<on) or the off-rate 
(koff) of binding to a receptor or other molecule. From thermodynamics, these rates are related to the 
20 equilibrium dissociation constent. Kd = (koff/kon)- We envision that certain variants of a particular protein have 
similar Kd's for binding white having very different kon's and koff's. Conversely, changes in Kd from one variant to 
another may be due to effects on kon, effects on koff, or both. The pharmacological properties of a protein may 
be dependent on binding affinity or on kon or koff. depending on the detailed mechanism of adon. Here, we sought 
to ktentify hGH variante with higher onn^es to investigate the effects of changes in ken- We envision that the 
25 selectton could alternatively be weighted toward koff by increasing the binding time and increasfrig the wash bne 
and/or concentration with cognate ligand (hGH). 

From time^auise analysis of wikJ-type hGH-phagemW binding lo immobilized hGHbp, it appears that, 
of the totel hGH-ph^id partfcles that can be eluted &i the final pH 2 wash (see Exampte VIII for the complete 
binding and elution protocol), less than 10% are bound after t tninute of incubatton, white greater than 90% are 
30 bound after 15 m&iutss of fricubatfon. 

For •raprd43inding selecfion," phagemid parlictes ftom the pH0707B pool pwfce-setected for helices 1 
and4independently)were incubated wittiimmobiized hGHbp for onlyl minute, then washed sixtmeswitt^ 1 mLof 
trindir^ buffen the hGH-wash step was omitted; and the remaining hGH-phagemid partictes were eluted with a 
pH2 fl).2M glycine in binding buffer) wash. Enrichment of hGH-phagemid particles over non-displaying particles 
35 indicated that even with a short binding period and no cognate-ligand (hGH) chaltenge, hGH-phagemW binding 
selection sorts tight-binding variants out of a randomized pool. 
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fSaSN P* nwrtants 

The binding constants for some of these mutants of liGH to trGHtip was detennined by expressing the 
free hormone variants in the non-suppressor £ co// strain 16C9 or 34B8, purifying the protein, and assaying by 
competitive displacement of labelled wt-hGH from hGHbp (see Example VIII) in a radio-immunopredpitation assay. 
5 In Table XIII -A below, all the variants have glutamatei74 replaced by serinei74 and phenylalanineijg replaced by 
tyrosinei76 (E174S and F1176Y) plus the additional substltullons as hdicated at hGH amino add posllions 10, 
14. 18, 21, 167, 171, 175 and 179. 



10 



wild-type residue: 



Table Xlll-A 

hGH variants from addWon of hellx>l and heax<4b najlations 



Helix 1 

F10 



Helix4 



H1B H21 R167 D171 T175 1179 





Variant 


















15 


H0650AD 


H 


G 


N 


N 


N 


S 


T 


T 




H0650AE 


H 


G 


N 


N 


E 


S 


T 


1 




H0650AF 


H 


G 


N 


N 


N 


N 


T 


T 




H0650BD 


A 


W 


D 


N 


N 


S 


T 


T 




H0650BE 


A 


W 


D 


N 


E 


S 


T 


1 


20 


H0650BF 


A 


W 


D 


N 


N 


N 


T 


T 




H0650CO 


F 


S 


F 


L 


N 


S 


T 


T 




H0650CD 


F 


S 


F 


L 


E 


S 


T 


1 




H0650CD 


F 


S 


F 


L 


N 


N 


T 


T 



25 In Table XIV below, hGH variants were selected from combinatorial libraries by the phagemid binding 

selectton process. All hGH variants in Table XIV contain two background mutations (E174S/F176Y). tiGH- 
phagemid pools from the libraries pH0707A {Part A), pH0707B and pH0707E (Part B), or pH0707C (Part C) 
were sorted for 2 to 7 cycles for binding to hGHbp. The number £ indicates the fractional occurrence of each 
variant type among the set of clones sequenced from each pool. 
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Table XIV 

tiGKvaiaibfnmiiomion&phageniidbin^ 



10 



15 



20 



25 



30 



35 



40 



wild-<type residue: 

_E Variant 
Part A: 4 cycles: 

0.6Q H0714A.1 



HerK4 



£ia MH ma azi bisz qizi iis Jia 



0.40 
Part B: 

0-13 
0.13 
0.13 
0.13 
0.13 
0.13 
0.13 
0.13 
0.13 
0.13 
0.13 
0.13 
0.13 
0.13 
0.13 

0.17 
0.17 
0.17 
0.17 
0.17 
0.17 

0.57 
0.14 
0.14 
0.14 
0.50 
Part C: 

0.67 



H0714A.4 

ZcycSes: 

H0712B.1 

H0712B.2 

H0712B.4 

H0712B.5 

H0712B.6 

H0712B.7 

H0712B.8 

H0712E.1 

H0712E.2 

H0712E.3 

H0712E.4 

H0712E.5 

H0712E.6 

H0712E.7 

H0712E.8 

4 cycles: 

H0714B.1 

H0714B.2 

H0714B.3 

H0714B.4 

H0714B.5 

H0714B.6 

7cycies: 

H0717B.t 

H0717B.2 

H0717B.6 

H07t7B.7 

H0717E1 

4cydes: 
H0714C.2 



F 
H 
H 
F 
A 
Q 
W 
F 
W 
A 
P 
H 
F 
M 
F 

A 
A 
M 
H 



A 

F 



G 



S 
Q 
G 
S 
S 
Y 
G 
L 
N 
N 
S 
G 
S 
T 
S 

W 
W 
Q 
Y 
N 
N 

W 
S 
W 
Q 
S 



F 
T 
N 
F 
T 
N 
S 
S 
N 
A 
D 
N 
T 
S 
F 

D 
D 
M 
D 
S 
S 

D 
T 
D 
E 
L 



N 
A 



G 
S 
N 
L 
N 
N 
S 
S 
S 
S 
N 
N 
G 
N 
L 

N 
N 
N 
H 
H 
H 

N 
G 
N 
H 
A 



N 
N 



H 
A 
N 
S 
R 
H 
R 
K 
H 
N 
R 
N 
R 
Q 
T 

R 
H 
N 
R 
R 
T 

N 
R 
R 
N 
N 



S 
N 



S 
D 
A 
D 
D 
S 
D 
N 
S 
S 
D 
N 
D 
S 
S 

D 
S 
S 
D 
D 
S 

A 

D 
D 
S 
S 



T 
T 



T 
N 
T 
T 
T 
T 
T 
T 
T 
T 
T 
T 
T 
T 
T 

T 
T 
T 
T 
T 
T 

T 
T 
T 
T 
T 



N 



T 
S 
T 
T 
I 

T 
I 

V 

T 
T 
I 

S 
I 

T 
S 

I 

N 

T 
T 
I 

T 

T 

I 

I 

T 
V 



45 * = also contained the mutations L15R, K168R. 



In Table XV below, hGH variants were selected from combinatorial libraries by ttie phagemid binding 
selection process. Al hGH variants in Table XV contain two bactcground mutatons (E174S/F176Y). Ttie number 
50 E Is *ie fractional occurrence of a given \ariant among all dones sequenced after 4 cycles of rapid-binding 
selection. 
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Table XV 

hGH vailaiMs fton RAPID hGHbp binding selection of an hGH-phagemid combinatorial library 



5 








Hefixl 








Helix 4 








wild-type 


residue: 


m 


MU 


HIS 


m 


R167 


D171 


T175 


1179 






Variant 




















0.14 


H07BF4.2 


W 


6 


S 


s 


R 


D 


T 


1 


10 
























0.57 


H07BF4.3 


M 


A 


D 


N 


N 


S 


T 


T 




0.14 


H07BF4.6 


A 


W 


D 


N 


S 


S 


V 


Tt 


15 


0.14 


H07BF4.7 


H 


Q 


T 


s 


R 


D 


T 


1 



t » also contained the mutation Y176F (wild-type hGH also contains F176). 



In tai>le XVI k)elow. binding constants were measured t)y competitive displacement of 125|-iabelled 
hormone H0650BD or labelled hGH using hGHbp (1-238) and either MabS or Mab263. The variant H0650BD 
appears bind more than 30-fold tighter than wild-type hGH. 
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Table XVI 



Equilibrium bimaiYl constants of selectBd hGH vaif^ 



5 


hGH 
Variant 


Kdfvariantt 
Kd(H0650BD) 


Kd(vananti 
Kd(h6H} 


Kd (pMl 




hGH 


32 


-1- 


340 ± 50 


10 


H0650BD 


-1- 


0.031 


10 ±3 




H0650BF 


\S 


0.045 


15 ±5 




H0714B.6 


3.4 


0.099 


34± 19 


15 


H0712B.7 


7.4 


0.22 


74 ± 30 




H0712E2 


16 


0.48 


60 ±70 



EXAMPLE XIII 



Selective enrichmentof hGIH)liage containing a protease substrate sequence versus noi>«ubsirate phage 

25 As described in Example I. the plasmid pS0l32 contains the gene lor hGH fused to the residue Pto198 of Ihegene 
III protein with the insertion of an extra glycine residue. This plasmid may be used to produce hGH-phage particles 
in which the hGH-gene III fusion productis displayed monovalently on the phage surface (Example IV). The fusion 
protein comprises the entire hGH protein fused to the carboxy terminal domain of gene III via a flexible inl«r 
sequence. 

30 To investigate the feasibility of using phage display technology to select favourable substrate 

sequences foragiven proteolytic enzyme.ageneticallyen9-neered variant of subfiiisin 
et al.. Proteins: Stmcture, funcifon and genetics 6:240-248 (1989)). This variant (hereafter referred to as 
A64SAL subBlisin) containethe following mufaUons: Ser24Cys. His64Ala, Glu156Ser, Gly169Ala and 
Tyr217l^o. Since this enzyme lacks the essential catalytic residue His64. its substrate specificity is greatly 

35 restricted so that certain histidine^ntainlng substrates are preferentially hyrdrolysed (Carter et at.. Science 
237:394-399 (1987)). 
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Constructjon of a hGhVsuhstrate-Dhaoe vector 

The sequence of the linker region in pS0132 was mutated to create a substrate sequence for A64SAL 
subtilisin, using the oligonucleotide 5'-TTC-GGG-CCC-TTC-GCT-GCT-CAC-TAT-ACG-CGT-CAG-TCG- 
ACT-GAC-CTG-CCT-3'. This resulted in the introduction of the protein sequence Phe-Gly-Pro-Phe-Ala-Ala- 
5 His-Tyr-Thr-Arg-Gln-Ser-Thr-Asp in the linker region between hGH and the carboxy tenninal domain of gene III, 
where the first Phe residue in the above sequence is Phe191 of hGH. The sequence Ala-Ala-His-Tyr-Thr-Agr-Gtn 
is known to be a good substrate for A64SAL subtilisin (Carter et al (1989), supra). The resulting pfasmid was 
designated pS0640. 

10 Selective enrichment of hGH-substratMtfiage 

Phagemid particles derived from pS0l32 and pS064D were constructed as described 
in Example I. In initial experiments, a 10^ aliquot of each phage pool was separately mixed with 
30M.1 of oxirane beads (prepared as described in Example II) in lOOjil of buffer comprising 
20mM Tris-HCI pH 8.6 and 2.5M NaCI. The binding and washing steps were performed as 

1 5 described in example VII. The beads were then resuspended in 400^1 of the same buffer, with 
or without 50nM of A64SAL subtilisin. Following incubation for 10 minutes, the supematants 
were collected and the phage titres (cfu) measured. Table XVII shows that approximately 10 
times more substrate-containing phagemid particles (pS0640) were eluted in the presence of 
enzyme than in the absence of enzyme, or than in the case of the non-substrate phagemids 

20 (pS0l32) In the presence or absence of enzyme. Increasing the enzyme, phagemid or bead 
concentrations did not improve this ratio. 

Improvement of the selective enrichment procedure 

in an attempt to decrease the non-specific elution of Immobilised phagemids, a tight- 

25 binding variant of hGH was introduced in place of the wild-type hGIH gene in pS0132 and 

pS0640. The hGH variant used was as described In example XI (pH0650bd) and contains the 
mutations PhelOAIa, Met14Trp, HislSAsp, Hls21Asn, Arg167Asn, Asp171Ser, Glu174Ser, 
Phe176Tyr and llel79Thr. This resulted in the construction of two new phagemids: pDM0390 
(containing tight-binding hGH and no substrate sequence) and pDM041 1 (containing tight- 

30 k>inding hGH and the substrate sequence Ala-Ala-His-Tyr-Thr-Agr-GIn). The binding washing 
and elution protocol was also changed as follows: 

(i) Binding: COSTAR 12-weil tissue culture plates were coated for 1 6 hours with 
0.5ml/weil 2ug/ml hGHbp in sodium cariDonate buffer pH 10.0. The plates were then incubated 
with 1 ml/well of blocking buffer (phosphate buffered saline (PBS) containing 0.1%w/v bovine 

35 serum albumen) for 2 hours and washed in an assay buffer containing 1 0mf^ Tris-HCI pH 7.5, 
TmM EDTA and lOOmM NaCI. Phagemids were again prepared as described in Example I: the 
phage pool was diluted 1 :4 in the above assay buffer and 0.5ml of phage incubated per well for 
2 hours. 
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(ii) Washing: The plates were washed thoroughly with PBS + 0.05% Tween 20 
and incubated for 30 minuted with 1ml of this wash buffer. This washing step was repeated 
three times. 

(ii) Bjtlon: The plates were Incubated for 1 0 minutes in an elutlon buffer 
5 consisting of 20mM Tris-HCI pH 8.6 + 1 0OmM NaCI, then the phage were eluted with 0.5ml of 
the above buffer with or without 500nM of A64SAL subtilisin. 

Table XVII shows that there was a dramatic increase in the ratio of specifically eluted 
substrate-phagemid particles compared to the method previously described for pS0640 and 
pS0132. It is likely that this is due to the fact that the tight-binding hGH mutant has a 
1 0 significantly slower off-rate for binding to hGH binding protein compared to wild-type hGH. 

Table XVU 

Specific eiution of substrate-phagemids by A64SAL subtilisin 

15 

Colony forming units (cfu) were estimated by plating out 10^1 of 10-fold dilutions of phage on 
10)il spots of XL-1 blue cells, on LB agar plates containing 50|ig/ml carbenidllinl 

0 Wild-type hGH gene: binding to hGHbp-oxirane beads 

phaoemid 4- mnM A64SAL RQ enZYme 

PS0640 (substrate) 9x1 O^cf u/1 0^1 1 .5x1 06cfu/1 O^ti 

PS0132 (non-substrate) 6x1 0%u/l O^il SxlO^cfu/lOjil 

(ii) pH0650bd mutant hGH gene: binding to hGHbp-coated plates 

phagemid 5QnM A64SAL W enZYmg 

PDM0411 (substrate) 1.7x105cfu/10nf 2x103cfu/10nl 

PDM0390 (non-substrate) 2x1 O^cfu/I O^il 1x1 O^cfu/I Oyii 

35 Example XiV 

identification of preferred substrates for A64SAL subtilisin using selective 
enrichment of a library of substrate sequences. 

40 We sought to emptoy the selective enrichment procedure described in Example XIII to 

identify good substrate sequences from a library of random substrate sequences. 

rtnngfninfion of a yfifitorfor ingfirtinn of randomised substrate cassetteS 

We designed a vector suitable for introduction of randomised substrate cassettes, and 
45 subsequent expression of a library of substrate sequences. The starting point was the vector 
pS0643, described in Example Vlir. Site-directed mutagenesis was carried out using the 
ongonudeotide 5'-AGC-TGT-GGC-TTC-QS£H2£C-GCC^CC-GC£^^ 
TCT-3', which introduces Acal (GGGCCC) and Sail (GTGGAC) restriction sites between hGH 
and Gene ML This new constaict was designated pDM0253 (The actual sequence of pDM0253 
50 is S^AGC-TGT-GGC-TTC-GGG-CCC-GCC-CCC-GCG-TCG-ACT-GGC^^ where 



20 



25 



30 



wo 92/09690 



57 



PCr/US91/09133 



the underlined base substitution is due to a spurious error in the mutagenic oligonucleotide). 
In addition, the tight-binding hGH variant described in example was Introduced by exchanging 
a fragment from pDM041 1 (example XIII) The resulting library vector was designated 
PDM0454. 

5 

Preparation of the library cassette vector and insertion of the mutagenic 

cassette 

To introduce a library cassette, pDM0454 was digested with Apal foilowed by Sail, then 
precipitated with 13% PEG 8000+ 10mM MgCl2. washed twice in 70% ethanoi and 

1 0 resuspended This efficiently precipitates the vector but leaves the small Apa-Sal fragment in 
solution (Paithankar, K. R. and Prasad, K. S. N., Nucleic Acids Research 19:1346). The 
product was run on a 1% agarose gel and the Apal-Sail digested vector excised, purified using 
a Bandprep kit (Pharmacia) and resuspended for ligation with the mutagenic cassette. 

The cassette to be Inserted contained a DNA sequence similar to that in the linker 

1 5 region of pS0640 and pDM041 1 , txjt with the codons for the histidlne and tyrosine residues in 
the substrate sequence replaced by randomised codons. We chose to substitute islNS 
(N=::G/A/T/C; S=G/C) at each of the randomised positions as described in example VIII. The 
oligonucleotides used in the mutagenic cassettes were: 5 -C-TTC-GCT-GCT-NNS-NNS-ACC- 
CGG-CAA-3' (coding strand) and 5'-T-CGA-TTG-CCG-GGT-SNN-SNN-AGC-AGC-GAA-GGG- 

20 CC-3* (non-coding strand). This cassette also destroys the Sail site, so that digestion with Sail 
may t>e used to reduce the vector background. The oligonucleotides were not 
phosphorylated before Insertbn into the Apa-Sal cassette site, as it was feared that 
subsequent oligomerisation of a small population of the cassettes may lead to spurious results 
with multiple cassette inserts. Foltowing annealing and ligation, the reaction products were 

25 phenokchlorofonn extracted, ethanoi predpitated and resuspended in water. Initially, no 

digestion with Sail to reduce the background vector was performed. Approximately 200ng was 
electroporated into XL-1 blue cells and a phagemid library was prepared as described in 
example VIII. 

30 Selection of highly cleavable substrates from the substrate library 

The selection procedure used was identical to that described for pDM041 1 and 
pDM0390 in example XIII. After each round of selection, the eluted phage were propagated by 
transducing a fresh culture of XL-1 blue cells and propagating a new phagemid library as 
described for hGH-phage in example VIII. The progress of the selection procedure was 
35 monitored by measuring eluted phage titres and by sequencing individual clones after each 
round of selection. 

Table A shows the successive phage titres for elution in the presence and absence of 
enzyme after 1 , 2 and 3 rounds of selection. 
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Clearly, the rafio of specifically eluted phage: non-speciflcally eluted phage (le phage eiuted 
with enzymephage eluted vwlthout enzyme) increases diamatically from round 1 to round 3, 
suggesting that the population of good substrates is increasing with each round of selection. 
Sequencing of lO isolates from the starting library showed them all to consist of the 
5 wild-^ pDM0464 sequence. This is attributed to the fact that after digestion with Apal. the 
Sail site is very close to the end of the DNA fragment, thus leading to low efficiency of 
digestion. Nevertheless, there are only 400 possible sequences in the library, so this 
population should still be well represented. 

Tables B1 and B2 shows the sequences of isolates obtained after round 2 and round 

10 3 of selection. After 2 rounds of selection, there is clearly a high incidence of histidine 
residues. This is exactly what is expected: as descrlbed in example XIII, A64SAL subtilsin 
requires a histidine residue in the substrate as it employs a substrate-assisted catalyfic 
mechanism. After 3 rounds of selection, each of the 1 0 clones sequenced has a histidine in 
the randomised cassette. Note, however, that 2 of the sequences are of pDM041 1 , which was 

15 not present in the starting library and is therefore a contaminant. 
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Table A 

Titration of initial phage pools and eluted phage from 3 rounds of selective 

enricliment 



10 



15 



20 



25 



30 



35 



Colony forming units (cfu) were estimated by plating out 10)xl of 10-fold dilutions of phage on 
10|xl spots of XL-1 blue cells, on LB agar plates containing 50(ig/mi carbenicillin 

ROUND 1 



Starting library: 

UBRARY: 

PDM0411: 
(control) 

ROUND 2 
Round 1 library: 

LIBRARY: 

PDM0411: 
(control) 

ROUNDS 

Round 2 library: 

LIBRARY: 



PDM0411: 
(control) 



3x1 0l 2 cfu/ml 

4500nM A64SAL 
no enzyme 

•i^OOnM A64SAL 
no enzyme 



7x10^2 cfu/ml 

■h500nM A64SAL 
no enzyme 

•i^OOnM A64SAL 
no enzyme 



7x10^1 clu/ml 

+500nM A64SAL 
no enzyme 

■i-500nM A64SAL 
no enzyme 



4x1 03 cfu/IOnl 
3x1 03 cfu/10^1 

2x106cfu/10nl 
8x1 0^ cfu/IOfil 



3x10"^ Cfu/IOnl 
6x1 03 cfu/10^1 

3x1 06 cfu/IOjii 
1.6x104 cfu/10|il 



1x1 05 cfu/10^1 
<103 cfu/IOnl 

5x106cfu/10nl 
3x1 04 cfu/IO^ll 
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Table B1 

Sequences of eiuted phage after 2 rounds of selective enrichment. 

5 AH protein sequences should be of the tonn AA**TRQ, where * represents a 

randomised codon. In the table below, the randomised codons and amino acids are underlined 
and in bold. 



10 



15 



30 



35 



After round 2: 



Sequence NQi Of 

occurrences 



A A fl 2 T R Q 

GCT GOT r'ar toc ACC CGG CAA 

A A fl ii T R Q 

20 ... GCT GCT ffRg ATG ACC CGG CAA 

A A It a T R Q 

. . . GCT GCT rjg ACC CGG CAA 

25 A A L fi T R Q 

. . . GCT GCT rjPfi cue ACC CGG CAA 



A A fi 1 R Q 

GCT GCT CAC ACC CGG CAA ... 1 # 

A A 2. a T R Q 
GCT GCT ACC CGG CAA 1 ## 

wild-type pDM0454 3 



# - spurious deletion of 1 codon within the cassette 
## - amt^guous sequence 
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Sequences of eluted nhaae after 3 rounds of selective enrichment. 

5 All protein sequences should be of the form AA**TRQ. where * represents a 

randomised codon. In the table below, the randomteed codons and amino adds are underlined 
and in bold. 



After round 3: 

10 



1 u 






Sequence 






























occurrences 








* 


* 














A 


A 






rn 

± 




\e 






15 


, . . GCT 


GCT 




•PAT 


ACG 


CGT 


CAG . . . 




2 # 




A 


A 


T 

M 


a 


T 


R 


Q 








. . . GCT 


GCT 






ACC 


CGG 


CAA . . . 




2 


20 


A 


A 


Q. 


a 


T 


R 


Q 








. . . GCT 


GCT 


C2i&.. 


c&c 


ACC 


CGG 


CAA • • • 




1 




A 


A 


Z 


a 


T 


R 


Q 








. . . GCT 


GCT 


hQQ 




ACC 


CGG 






1 


25 






















A 


A 


a 


a 


R 


Q 










. . . GCT 


GCT 






CGG 


CAA 




1 






A 


A 


fi 


a 


T 


R 


Q 






30 


. , , GCT 


GCT 






ACC 


CGG 


CAA 


1 


## 




A 


A 


a 




R 


Q 










, . . GCT 


GCT 






CGG 


CAA 




1 




35 


A 


A 


a 




R 


Q 










. . GCT 


GCT 






CGG 


CAA 




1 





# - contaminating sequence from pDM0411 

## - contains the "illegal" codon CAT - T should not appear in the 3rd position of a 
40 codon. 



45 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION 
<x) APPLICANT 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



Genentechr Inc. 
Garraxd, Lisa J. 
Henner, Dennis J. 
Bass, Steven 
Greene , Ronald 
Lowman, Henry B. 
Wells r James A* 
Matthews, David J, 

(11) TITLE OF INVENTION: Enrichment Method For Variant Proteins With 
Altered Binding Properties 

(ill) NUMBER OF SEQUENCES: 27 

(Iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Genentech, Inc. 

(B) STREET: 460 Point San Bruno Blvd 

(C) CITY: South San Francisco 

(D ) STATE : California 

(E) COUNTRY: USA 

(F) ZIP: 94080 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: 5.25 Inch^ 360 Kb floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE; patln (Genentech) 

(vl) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 03-DEC-91 

(C) CLASSIFICATIONS 

(vli) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 07/743614 

(B) APPLICATION DATE: 09-Aug-91 

(vli) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 07/715300 

(B) APPLICATION DATE: 14-June-91 

(vli) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 07/683400 

(B) APPLICATION DATE: lO-Apr-91 

(vli) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 07/621667 

(B) APPLICATION DATEz 03-Dec-1990 

(vlii) ATTORNEY/AGENT iNFORldATIONs 

(A) NAME; Benson, Robert H. 

(B) REGISTRATION NUMBER: 30,446 

(C) REFERENCE/DOCKET NUMBER: 645P4 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 415/266-1489 

(B) TELEFAX: 415/952-9881 

(C) TELEX: 910/371-7168 

(2) INFORMATION FOR SEQ ID NO:l: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 bases 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NOsl; 

6GCAGCTGT6 GCTTCTAGAG TGGCGGCGGC TCTGGT 36 



10 (2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 bases 

(B) TYPE: nucleic acid 
15 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



20 



25 



65 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

AGCTGTGGCT TCGGGCCCTT AGCATTTAAT 6CGGTA 36 
(2) INFORMATION FOR SEQ ID NO: 3: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 bases 

( B ) TYPE : nucleic acid 
<C) STRANDEDNESS; single 

30 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 
35 TTCACAAAC6 AAGGGCCCCT AATTAAA6CC AGA 33 

(2) INFORMATION FOR SEQ ID NO: 4: 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
CAATAATAAC GGGCTAGCCA AAAGAACTGG 30 

50 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 24 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

CACGACAGAA TTCCCGACTG GAAA 24 



(2) INFORMATION FOR SEQ ID NO: 6: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 23 bases 

(B) TYPES nucleic acid 

(C) STRAMDEDNESS: single 

(D) TOPOLOGX: linear 

^ (xi) SEQUENCE DESCRIPTIONS SEQ ID NOs6t 

CTGTTTCXAG AGTGAAATTG TTA 23 

10 

(2) INFORMATION FOR SEQ ID N0s7s 

(i) SEQUENCE CHARACTERISTICS s 
15 (A) LENGTH: 21 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
{ D ) TOPOLOGY : 1 inear 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 



ACATTCCTGG GTACCGTGCA 6 21 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 63 bases 
30 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

{xi> SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

35 

GCTTCAG6AA GGACATGGAC NNSGTCNNSA CANNSCT6NN SATCGTGCAG 50 



40 TGCC6CTCTG TGG 63 



(2) INFORMATION FOR SEQ ID NO:9: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: nucleic acid 

( C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

50 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
AAGGTCTCCA CATACCTGAG GATC 24 

55 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH: 33 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

65 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 



ATGGACAAGG TGTCGACATA CCTGCGCATC GTG 33 
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10 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHl^RACTERISTICS : 

(A) LENGTH: 36 bases 

(B) TYPE: nucleic acid 

(C) STHANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 



15 



30 



50 



G6CAGCT6TG GCTTCTAGAG TGGCGGCGGC TCTGGT 36 



(2) INFORMATION FOR SEQ ID NO: 12: 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 36 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

GGCAGCTGTG GATTCTAGAG TGGCGGTGGC TCTGGT 36 



(2) INFORMATION FOR SEQ ID NO: 13: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 12 amino acids 
35 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

40 Gly Ser Cys Gly Phe Glu Ser Gly Gly Gly Ser Gly 
1 5 10 12 

(2) INFORMATION FOR SEQ ID NO: 14: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 



55 



C6GACTGG6C AGATATTCAA 6CAGACC 27 



(2) INFORMATION FOR SEQ ID NO: 15: 



(i) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH: 38 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

65 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 



CTCAAGAACT ACGGGTTACC CTGACTGCTT CAGGAAGG 38 
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10 



(2) INFORMATION FOR SEQ ID NOrl6: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTHt 30 bases 

(B) TYPEr nucleic acid 

(C) STRANDEDNESSs Single 

(D) TOPOLOCri linear 

(xi) SEQUENCE DESCRIPTIONi SEQ ID NO: 16: 



CGCATCGTGC AGTGCAGATC TGTGGAGGGC 30 

15 

(2) INFORMATION FOR SEQ ID NOslTi 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 66 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESSs single 

(D) TOPOLOGY: linear 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 



GTTACTCTAC TGCTTTCAGG AAGGACATGG ACNNSGTCNN SACANNSCTG 50 

30 

NNSATCGT6C AGT6CA 66 



(2) INFORMATION FOR SEQ ID NO: 18: 

35 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
40 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 



45 GATCTGCACT GCACGATSNN CAGSNNTGTS NNGACSNNGT CCATGTCCTT 50 



CCTGAAGCAG TA6A 64 

50 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 25 bases 
55 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : linear 

Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

60 

GCCTTTGACA GGTACCAGGA GTTTG 25 



65 (2) INFORMATION FOR SEQ ID NO:20: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 33 bases 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLCXJY: linear 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 



CCAACTATAC CACTCTCGAG GTCTATTCGA TAA 33 

10 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 66 bases 
15 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

20 

TCGAGGCTCN NSGACAACGC GNNSCTGCGT GCTNNSCGTC TTNNSCAGCT 50 



25 GGCCTTTGAC ACGTAC 66 



(2) INFORMATION FOR SEQ ID NO: 22: 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 



GTGTCAAAGG CCAGCTGSNN AAGACGSNNA GCACGCA6SN NCGCGTTGTC 50 

40 

SNNGAGCC 58 



45 (2} INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 65 bases 

(B) TYPE: nucleic acid 
50 (C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 

55 

GTTACTCTAC TGCTTCNNSA AGGACATGNN SAA6GTCAGC NNSTACCTGC 50 



GCNNSGTGCA GTGCA 65 

60 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 
65 (A) LENGTH: 64 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



68 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

GATCTCCACT GCACSNNGCG CAGGTASNNG CTGACCTTSN NCATGTCCTT 50 
SNNGAAGCAG TAGA 64 

(2) INFORMATION FOR SEQ ID NO: 25; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2178 bases 

(B) TYPE: nucleic acid 

( C ) STRANDEDNE S S : S ingle 

(D) TOPOLOGY: linear 

(Xl) SEQUENCE DESCRIPTION: SEQ ID NO:25: 

ATGAAAAAGA ATATCGCATT TCTTCTTGCA TCTAT6TTCG TTTTTTCTAT 50 
TGCTACAAAC GCGTACGCTG ATATCCAGAT GACGCAGTCC CCGAGCTCCC 100 
TGTCC6CCTC TGTGGGCGAT A6GGTCACCA TCACCTGCCG TGCCAGTCAG 150 
GATGTGAATA CTGCtGTAGC CTGGTATCAA CAGAAACCA6 GAAAAGCTCC 200 
GAAACTACTG ATTTACTCGG CATCCTTCCT CTACTCTGGA GTCCCTTCTC 250 
GCTTCTCTGG ATCCAGATCT 6G6ACG6ATT TCACTCTGAC CATCAGCAGT 300 
CTGCAGCCGG AAGACTTCGC AACTTATTAC TGTCAGCAAC ATTATACTAC 350 
TCCTCCCAGG TTC66ACAGG GTACCAAGGT GGAGATCAAA CGAACTGTGG 400 
CTGCACCATC TGTCTTCATC TTCCCGCCAT CTGATGAGCA GTTGAAATCT 450 
GGAACT6CCT CTGTTGTGTG CCTGCTGAAT AACTTCTATC CCAGAGAG6C 500 
CAAAGTACAG TGGAAGGTGG ATAACGCCCT CCAATCGGGT AACTCCCAGG 550 
AGAGTGTCAC AGAGCAGGAC AGCAAGGACA GCACCTACAG CCTCAGCA6C 600 
ACCCTGACGC TGAGCAAAGC AGACTACGAG AAACACAAAG TCTACGCCTG 650 
CGAAGTCACC CATCAGGGCC TGAGCTCGCC CGTCACAAAG AGCTTCAACA 700 
GGGGAGAGTG TTAAGCTGAT CCTCTACGCC GGACGCATCG TGGCCCTAGT 750 
ACGCAAGTTC ACGTAAAAAG GGTATCTAGA GGTTGAGGTG ATTTTATGAA 800 



69 

AAAGAATATC GCATTTCTTC TTGCATCTAT 
CAAACGCGTA CGCTGAGGTT CAGCTGGTGG 
CAGCCAGGGG GCTCACTCCG TTTGTCCTGT 
TAAAGACACC TATATACACT 6GGTGCGTCA 
AATGGGTTGC AAGGATTTAT CCTACGAATG 
AGCGTCAAGG GCCGTTTCAC TATAAGCGCA 
CTACCTGCAG ATGAACAGCC TGCGTGCTGA 
GTTCTAGATG GGGAGGGGAC GGCTTCTATG 
GGAACCCTGG TCACCGTCTC CTCGGCCTCC 
CCCCCTGGCA CCCTCCTCCA AGAGCACCTC 
GCTGCCTGGT CAAGGACTAC TTCCCCGAAC 
TCAGGCGCCC TGACCAGCGG CGTGCACACC 
CTCAGGACTC TACTCCCTCA GCAGCGTG6T 
TGGGCACCCA GACCTACATC TGCAACGTGA 
AAGGTGGACA AGAAAGTTGA GCCCAAATCT 
GCCCTTCGTT TGTGAATATC AAGGCCAATC 
CTGTCAATGC TGGCGGCGGC TCTGGTGGTG 
GGTGGTGGCT CTGAGGGTGG CGGTTCTGAG 
CGGTTCCGGT GGTGGCTCTG GTTCCGGTGA 
CAAACGCTAA TAAGGGGGCT ATGACCGAAA 
CAGTCTGACG CTAAAGGCAA ACTTGATTCT 
TGCTATCGAT GGTTTCATTG GTGACGTTTC 
GTGCTACTGG TGATTTTGCT GGCTCTAATT 



GTTCGTTTTT TCTATTGCTA 850 
AGTCTGGCGG TGGCCTGGTG 900 
GCAGCTTCTG GCTTCAACAT 950 
GGCCCCGGGT AAGGGCCTGG 1000 
GTTATACTAG ATATGCCGAT 1050 
GACACATCCA AAAACACAGC 1100 
GGACACTGCC GTCTATTATT 1150 
CTATGGACTA CTGGGGTCAA 1200 
ACCAAGGGCC CATCGGTCTT 1250 
TGGGGGCACA GC6GCCCTGG 1300 
CGGTGACGGT GTCGTGGAAC 1350 
TTCCCGGCTG TCCTACAGTC 1400 
6ACTGTGCCC TCTAGCAGCT 1450 
ATCACAAGCC CAGCAACACC 1500 
TGTGACAAAA CTCACACAGG 1550 
GTCTGACCTG CCTCAACCTC 1600 
GTTCTGGTGG CGGCTCTGAG 1650 
GGTGGCGGCT CTGAGGGAGG 1700 
TTTTGATTAT GAAAAGATGG 1750 
ATGCCGATGA AAACGCGCTA 1800 
GTCGCTACTG ATTACGGTGC 1850 
CGGCCTTGCT AATGGTAATG 1900 
CCCAAATGGC TCAAGTCGGT 1950 
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GACGGTGATA ATTCACXZTTT AATGAATAAT TTCCGTCAAT ATTTACCTTC 2000 
CCTCCCTCAA TCGGTTGAAT GTCGCCCTTT TGTCTTTAGC GCTGGTAAAC 2050 

5 

CATATGAATT TTCTATTGAT TGTGACAAAA TAAACTTATT CC6TGGTGTC 21O0 
TTTGCGTTTC TTTTATATGT TGCCACCTTT ATGTAT6TAT MTCTACGTT 2150 
TGCTAACATA CTGCGTAATA A6GAGTCT 2178 



15 



30 



45 



60 



(2) INFORMATION FOR SEQ ID NO: 26: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 237 amino acids 
20 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

25 Met LvB Lys Asn lie Ala Phe Leu Leu Ala Ser Met Phe Val Phe 
15 10 15 



Ser lie Ala Thr Asn Ala Tyr Ala Asp lie Gin Met Thr Gin Ser 
20 25 30 

Pro Ser Ser Leu Ser Ala Ser Val Gly Asp Arg Val Thr He Thr 
35 40 45 



Cys Arg Ala Ser Gin Asp Val Asn Thr Ala Val Ala Trp Tyr Gin 
35 50 55 60 

Gin Lys Pro Gly Lys Ala Pro Lys Leu Leu He Tyr Ser Ala Ser 
65 70 75 

40 Phe Leu Tyr Ser Gly Val Pro Ser Arg Phe Ser Gly Ser Arg Ser 

80 85 90 



Glv Thr Asp Phe Thr Leu Thr He Ser Ser Leu Gin Pro Giu Asp 
95 100 105 

Phe Ala Thr Tyr Tyr Cys Gin Gin His Tyr Thr Thr Pro Pro Thr 
110 115 120 



Phe Gly Gin Gly Thr Lys Val Glu He Lys Arg Thr Val Ala Ala 
50 125 130 135 

Pro Ser Val Phe He Phe Pro Pro Ser Asp Glu Gin Leu Lys Ser 
140 145 150 

55 Gly Thr Ala Ser Val Val Cys Leu Leu Asn Asn Phe Tyr Pro Arg 

155 160 165 



Glu Ala Lys Val Gin Trp Lys Val Asp Asn Ala Leu Gin Ser Gly 
170 175 180 

Asn Ser Gin Glu Ser Val Thr Glu Gin Asp Ser Lys Asp Ser Thr 
185 190 195 



Tyr ser Leu Ser Ser Thr Leu Thr Leu Ser Lys Ala Asp Tyr Glu 
65 200 205 210 

Lvs His Lys Val Tyr Ala Cys Glu Val Thr His Gin Gly Leu Ser 
215 220 225 
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Ser Pro Val Thr Lys Ser Phe Asn Arg Gly Glu Cys 
230 235 237 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 461 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:27: 

Met Lys Lys Asn lie Ala Phe Leu Leu Ala Ser Met Phe Val Phe 
15 10 15 

Ser He Ala Thr Asn Ala Tyr Ala Glu Val Gin Leu Val Glu Ser 
20 25 30 



Gly Gly Gly Leu Val Gin Pro Gly Gly Ser Leu Arg Leu Ser Cys 
20 35 40 45 

Ala Ala Ser Gly Phe Asn He Lys Asp Thr Tyr He His Trp Val 
50 55 60 

25 Arg Gin Ala Pro Gly Lys Gly Leu Glu Trp Val Ala Arg He Tyr 

65 70 75 



Pro Thr Asn Gly Tyr Thr Arg Tyr Ala Asp Ser Val Lys Gly Arg 

80 85 90 

Phe Thr He Ser Ala Asp Thr Ser Lys Asn Thr Ala Tyr Leu Gin 

95 100 105 



Met Asn Ser Leu Arg Ala Glu Asp Thr Ala Val Tyr Tyr Cys Ser 
35 110 115 120 

Arg Trp Gly Gly Asp Gly Phe Tyr Ala Met Asp Tyr Trp Gly Gin 
125 130 135 

40 Gly Thr Leu Val Thr Val Ser Ser Ala Ser Thr Lys Gly Pro Ser 

140 145 150 



Val Phe Pro Leu Ala Pro Ser Ser Lys Ser Thr Ser Gly Gly Thr 
155 160 165 

Ala Ala Leu Gly Cys Leu Val Lys Asp Tyr Phe Pro Glu Pro Val 
170 175 180 



Thr Val Ser Trp Asn Ser Gly Ala Leu Thr Ser Gly Val His Thr 
50 185 190 195 

Phe Pro Ala Val Leu Gin Ser Ser Gly Leu Tyr Ser Leu Ser Ser 

200 205 210 

55 Val Val Thr Val Pro ser Ser ser Leu Gly Thr Gin Thr Tyr He 

215 220 225 



Cys Asn Val Asn His Lys Pro Ser Asn Thr Lys Val Asp Lys Lys 

230 235 240 

Val Glu Pro Lys Ser Cys Asp Lys Thr His Thr Gly Pro Phe Val 

245 250 255 



Cys Glu Tyr Gin Gly Gin Ser Ser Asp Leu Pro Gin Pro Pro Val 
65 260 265 270 

Asn Ala Gly Gly Gly Ser Gly Gly Gly Ser Gly Gly Gly Ser Glu 
275 280 285 
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Glv Gly Gly Ser Glu Gly GLy Gly Ser Glu Gly Gly Gly Ser Glu 
^ 290 295 300 

Gly Gly Gly Ser Gly Gly Gly Ser Gly Ser Gly Asp Phe Asp Tyr 
5 305 310 315 

Glu Lys Met Ala Asn Ala Asn Lys Gly Ala Met Thr Glu Asn Ala 
320 325 330 

10 Asp Glu Asn Ala Leu Gin Ser Asp Ala Lys Gly Lys Leu Asp Ser 

335 340 345 



15 



30 



Val Ala Tlir Asp Xyr Gly Ala Ala lie Asp Gly Phe lie Gly Asp 
350 355 360 

Val Ser Gly Leu Ala Asn Gly Asn Gly Ala Thr Gly Asp Phe Ala 
365 370 375 



Gly Ser Asn Ser Gin Met Ala Gin Val Gly Asp Gly Asp Asn Ser 
20 380 385 390 

Pro Leu Met Asn Asn Phe Arg Gin Tyr Leu Pro Ser Leu Pro Gin 
395 400 405 

25 Ser Val Glu Cys Arg Pro Phe Val Phe Ser Ala Gly Lys Pro Tyr 

410 415 420 



Glu Phe Ser He Asp Cys Asp Lys He Asn Leu Phe Arg Gly Val 
425 430 435 

Phe Ala Phe Leu Leu Tyr Val Ala Thr Phe Met Tyr Val Phe Ser 
440 445 450 



Thr Phe Ala Asn He Leu Arg Asn Lys Glu Ser 
35 455 460 461 
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What is claimed is: 

1. A method for selecting novel binding polypeptides comprising: 

5 (a) constructing a replicable expression vector comprising 

a transcription regulatory element operai)ly linked to 
a gene fusion encoding a fusion protein wherein the gm 
a first gene encoding a polypeptide, and 
a second gene encoding at least a portion of a 
1 0 phage coat protein; 

( b ] mutating the vector at one or more selected positions within the first gene thereby fomiing a 
family of related plasmids; 

(c) transfomnlng suitable host cells with the plasmids; 

15 

( d ) infecting the transfomned host cells with a helper phage 

having a gene encoding the phage coat protein; 

(e) cuituring the transformed infected host ceils under conditions 

20 suitable for fomiing recombinant phagemid particles containing at least a portion of 

the piasmid and capable of transforming the host, the conditions adjusted so that no 
more than a minor amount of phagemid particles display more than one copy of the 
fusion protein on the surface of the particle; 

25 ( f ) contacting the phagemid particles with a target molecule so that at least a portion of the 

phagemid particles bind to the target molecule; and 

( g ) separating the phagemid particles that bind from those that do not 

30 2. The method of claim 1 further comprising infecting a suitable host ceils with the phagemid particles that 
bind and repeating steps (d) through (g). 

3. The method of claim 2 wherein the steps are repeated one or more times. 

35 4 . The method of claim 1 wherein the expression vector further comprises a secretory signal sequence. 



5. The method of claim 1 wherein the transcription regulatory etement is a promoter system selected 
from the group; lacZ, pho A, tryptophan, tac, b bacteriophage T7, and combinations 
thereof. 
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6. Themethod(rfcamlvi*i«eftiftefirstgeneetx»desamammalianpro^^ 

7. The method of daftn 6 wherein the protein is selected from the group; 

growth homione, human ^wth homiore{hGH), des-N-methionyi human growth homione. 
bovnne growth hormone, parathyroid homione, thyroxine, insulin A-chain. insulin B-chaln. 
proinsulin. relaxin A-chain, relaxin B-chaln, prorelaxin, fbllide stimulating hormone(FSH), 
thyroid stimulating honnone(TSH), leutinizing homione(LH). glycoprotein hormone receptors, 
calcitonin, glucagon, factor VIII. an antibody, lung surfactant, urokinase, slreptoianase, human 
tissue-type plasminogen acBvalor (t-PA), l)ombesin. factor IX, thrombin, hemopoietic growth 
factor, tumor necrosis factor-alpha and -beta, enkephalinase. human senm albumin, raullerian- 
InhWtlig substance, mouse gonadotiopln^ssocialed peptide, p-bctamase, tissue factor 
protein, inhlbin. activin, >ascular endothelial growth factor, inlegrin receptors, thrombopoietin, 
protein A or D, rheumatoid factors, NGF-p. platelet-growth factor, tiansfonning growth 
factor ; TGF-alpha and TGF-beta, insulin-ilte growth fador-l and -II, insolin-lilte growth 
factor b&xfing proteins . CEM, DMase, btency associated peptide, erythropoietin. HER2 
ligands. osleoinductiwe factors, interferon-alpha, -beta, and -gamma, colony stimulating 
factors (CSFs), M-CSF, 6M-CSF, and G-CSF, interieukins (11^). IL-1, IL-2. IL-3. IL-4. 
superoxkle dismutase; decay accelerattng factor, viral anfigen. HIV envelope proteins GP120 
and GP140. atrial natriuretic peptides A, B, or C, or immune globulins, and fragments of the 
above-listed proteins. 



8. The method of claim 7 wherein the protein is a human protein. 
25 9. The method of claim 8 wherein the protein comprises more than about 100 amino acid residues. 

10. The method of claim 1 wherein the protein comprises a plurali^ of rigid secondary strndures dfepbying 

amno adds capable of interacting wift the target, and the mutetions are primarily produced 
at positions conespondng to codons encoding the amlnoadds. 

30 

11. The method of daim 10 wherein the rigid secondary structures comprise structures selected from the 

group; a-(3.6i3)helix. 3io helix. «-(4.4i6)helix, parallel and anU-paraliel p-pleated sheets, 
reverse tums, and non-ordered stmctures. 

35 12. ThemethodofdaimlOwhereinthemufationsareproducedatmorethanonecodon. 

13. The method of claim 12 wherein Ihe mutations are produced on more than one rigid secondary structure. 
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10 



15 



14. The method of daim 1 wherein the helper phage is selected from the group M13K07, M13R408, M13- 

VCS,andPhiX174. 

15. The method of claim 14 wherein the helper phage is M13K07 and the coat protein is the M13 phage gene 

III coat protein. 

16. The method of claim 15 wherein the ho^ is £ co//. 

17. The method of claim 16 wherein the plasmid Is under tight control of the transcription regulatory 

element 

18. Tlie method of daim 17 wherein the amount is less than at)out 1%. 

19. The method of claim 18 wherein the amount is less than 20% the amount of phagemid particles displaying 

a single copy of the fusion protein. 

20. The method of daim 19 wherein the amount is less than 10%. 



21. The method of daim 1 further comprising in step (a), inserting a DNA triplet, encoding an mRNA 

suppressible temfiinator codon between said first gene encoding a polypeptide, and said 
20 second gene encoding at least a portion of a phage coat protein. 

22. The method of daim 21 wherein said mRNA suppressible temiinator codon is selected from ttie 

following: UAG (amber), UAA (ocher) and UGA (opel). 

25 23. The method of daim 22 wherein said suppressUe mutation results In the detectable producb'on of a 
fusion polypeptide containing sadi polypeptide and said coat protein when said expression 
vector is grown in a suppressor host cell; and, when grown in a non-suppressor host cell said 
polypeptide is synthesized substantially without fusion to said phage coat protein. 

30 24. A human growth hormone variant wherein hGH amino acids 1 72, 1 74, 1 76 and 1 78 respectively are as a 
group sequentially selected from one of the following: (1)R,S,F,R; (2)R,A,Y,R; {3)K,T,Y,K; 
(4)R,S.Y,R: (5)K,A.Y,R; (6)R,F.F,R; (7)K,Q.Y.R; (8) R,T,Y.H; (9)Q,R,Y,R; {10)K,K.Y,K; 
(11)R.S.F,S;and{12)K.S,N,R. 

35 25. A phagemid comprising a repiicabie expression vector comprising a transcription regulatory element 
operably linked to a gene fusion encoding a fusion protein wherein the gene fusion comprises a 
first gene encoding a polypeptide, and a second gene encoding at least a portion of a phage 
coat protein, wherein a DNA triplet codon encoding an mRNA suppressible terminator codon 
selected from UAG, UAA and UGA is inserted between the fused ends of the first and second 
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genes, or is substitifled for an amino add encoding triplet codon acSacent to gene fusion 
juncBon. 

26. The phagemid of daim 25 wfierein saidfirst gene encodes a mammalian protein. 

5 

27. The phagemid of dam 26 wherein the protein is selected from the group: growvth hormone, human 

growth homione (hGH), des-N-melhionyi human growth homione, bovine growth homione, 
parathyroid hormone, thyroxine, insulin A-chain. insulin B-chain. proinsulin, reiaxin A-chain, 
relaxin B-chain, proreiaxin, follicle stimulating homione (FSH), thyroid stimulating hormone 

1 0 (TSH), leutinizing hormone (LH), glycoprotein hormone receptors, cata'tonin, glucagon, factor 

vm, an antibody, lung surfactant, urokinase, streptokinase, human tissue-type plasminogen 
aciivator (t-PA), bombesin, factor IX, thrombin, hemopoietic growth factor, tumor necrosis 
factor-aipha and-beta, enkephalinase, human serum albumin, mulleriarr-lnhibiling substance, 
mouse gonadotropin-assodafed peptide, p-iactamase. tissue factor protein, inWbin, activin, 

1 5 vascular endothelial growth factor, integrin receptors, thrombopoietin, protein A or D. 

rheumatoid factors. NGF-p, plateiet-growlh factor, transforming growth factor; TGF-afpha 
and TGF-bela, insulin-like growth-! and -li, insuSn-like growth factor binding proteins. CE)-4, 
DNase, latency associated peptide, erythropoietin, osteoinductive factors, interferon-alpha, - 
beta, and -gamma, colony stimulating factors (CSFs). M-CSF, GM-CSF. and G-CSF, 

20 interleukins (ILs), IL-1, IL-2. lL-3, 1L4. superoxide dismutase; decay accelerating factor, viral 

antigen, HIV envetope proteins GP120 and GP140. atrial natriuretic peptides A, B or C 
Immune globulins, and fragments of the above-listed proteins. 

28. The phagemid of daim 27 wherein said protein Is a human protein. 

25 

29 . The phagemid of daim 28 wherein the protein comprises more than about 100 amino add resMues. 

30. The phagemM of dahi 25 wherein saM protein comprises a plurality of rigM secondary structures 

displaying amino adds capable of interacting with the facget 

30 

31. The phagemid of dafm 30 wherein saM rigW secondary structures comprises structures selected from 

the group; a-(3.6i3)helix, 3io helix. jr-(4.4i6)helix. parallel and antii5aralle! p-pleated 
sheets, reverse turns, and non-ordered structures. 

35 32. The phagemid of daim 25 wherein the helper phage is selected from the group M13K07, M13R408. 
M13-VCS,andPhiX174. 



33. 



The phagemid of claim 32 wherein the helper phage is M13K07 and the coat protein is the M13 phage 
gene 111 coat protein. 
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34. The phagemid of claim 33 wherein the host is the E co// wild type or suppressor type. 

35. The phagemid of daim 34 wherein the plasmid is under tight control of the transcription regulatory 
5 element 

36. The phagemid of claim 35 wherein the number of phagemid particles displaying more than one copy of 

the fusion protein on the surface of the particles is less than 1%. 

1 0 37. The phagemid of claim 36 wherein said number of phagemid particles is less than about 10%. 

38. The phagemid of claim 37 wherein the number of phagemid particles is less than about 20%. 

39. A human growth variant wherein hGH amino adds 10, 14, 18, and 21 respectively are as a group 
1 5 sequentially selected from one of the following: 

(1)H.G,N,N; (2)A,W,D,N; (3)F.S.F,L; {4)Y,T,V,N and (5)I,N,I.N. 

40 . A human growth variant wherein hGH amino acids 1 74 is serine and 1 76 is tyrosine and hGH amino adds 

167, 171 , 175 and 179 respectively are as a group sequentially selected from one of the 
20 following: 

(1)N,S JJ; {2)E,SJ,l; (3)K.SJ.L; (4)N,N,T.T; (5) R,D,I,I; and (6)N,S,T,Q. 

41 . A method for selecting novel binding polypeptides comprising 

(a) constmcdng a replicable expression vector comprising a 

25 transcription regulatory element operably linked to DNA encoding a protein of 

interest containing one or more subunits, wherein the DNA encoding at least one of 
the subunits is fused to the DNA encoding at least a portion of a phage coat protein; 

( b ) mutating the DNA encoding the protein of interest at one or more selected positions thereby 
30 forming a family of related vectors; 

( c ) transforming suitable host cells with the vectors; 

( d ) infecting the transformed host celts with a helper phage having a gene encoding the phage 
35 coat protein; 

(e) culturing the transformed Infected host cells under conditions suitable for forming 

recombinant phagemid particles containing at least a portion of the plasmid and 
capable of transforming the host, the conditions adjusted so that no more than a 
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minor amount of phageraid partides display more than one copy of the fusion protein 
on the surface of the particTe; 

( f ) contacting the phagemid particles with a target molecule so that at least a portion of the 
5 phagemid particles bind to the target molecule; and 

(g ) separating the phagemid particles that bind from those that do not 

42. The method of claim 41 wherein the expression vector further comprises a secretory signal sequence 
^ 0 operat)Iy linked to the DNA encoding each sulaunit of the protein of interest 

43 . The method of claim 42 wherein the protein of interest is a mammal'an protein. 

44 . The method of claim 43 wherein the protein of interest is selected from the group; 

^ 5 insulin, relaxin, follide stimulating homione (FSH), thyroid stimulafing hormone (TSH), 

leufinizing hormone (LH), glycoprotein homione receptors, monoclonal and polyclonal 
antibodies, lung surfactant, integrin receptors, hsuMke growth l&ctor-l and -II, and 
fragments of the above-listed proteins. 

20 45 . The method of daim 44 wherein the protein of interest is a humanized antibody. 

46 . The method of claim 45 wherein the protein of interest is a humanized Fab fragment capable of binding 
to the HER-2 receptor (human epidermal growth factor receptor-2). 

25 47 A human growth honnone (hGHJvartant wherein hGH amino add glutamalei74 is replaced by serinei74 
and phenylalaninei76 is replaced by tyrosinei 76 and one or more of the eight natiffally 
occurring h6H amino adcfe F10, M14, HIS. H21 , R167, D171, T175 and 1179 are replaced by 
another natural amino add. 

30 48 The hGH variant of daim 47 wherein the eightnaturallyoccum'ng hGH amino adds F10. M14, Hia, H21, 
R167, D171 , T175 and 1179 respectively are as a group replaced with a corresponding amino 
add sequentially selected from one of the following groups: 



35 



40 



(1) 


H, 


G, 


N, 


N, 




Sr 


Tr 


T; 


(2) 


Hr 


G, 


Nr 


Nr 


E, 


s. 


T, 


I; 


(3) 


H, 


Gr 


N, 


N, 


N, 


Nr 


T, 


T; 


(4) 


Ar 


w. 


Dr 


N, 


N, 


s. 


T, 


T; 


(5) 


A, 


w. 


D, 


N, 


Er 


Sr 


Tr 


I; 


(6) 


Ar 


w. 


D, 


N, 


N, 


Nr 


Tr 


T; 


(7) 


Fr 


s. 


F, 


Lf 


Nr 


s. 


Tr 


T; 


(8) 


F, 


s. 


F, 
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Er 


Sr 


T, 


1; 


(9) 


F, 


s. 


F, 




Nr 


Nr 


T, 


T. 


(10) 


H, 


G, 


N, 


N, 


N, 


Sr 


Tr 


N; 


(11) 


A, 


N, 


D, 


A, 


Nr 


Nr 


Tr 


N; 


(12) 


F, 


Sr 


F, 


Gr 


H, 


s. 


T, 


T; 


(13) 


H, 


Q, 




s. 


A, 


Dr 


N, 


S; 
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Nr 


N, 


Ar 


T, 


T; 
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F, 


s. 
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Sr 


Dr 


T, 


T; 


(16) 


A, 


Sr 


Tr 


N, 


Rr 


D, 


T, 


I; 


(17) 


Qr 


^, 
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Sr 
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Wr 


Gr 
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Sr 


Rr 


D, 


Tr 
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K, 
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Tr 


V; 


(20) 


w. 


Nr 


N, 


s. 


H, 


s. 


T, 


T; 
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(21) 
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(22) 
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(26) 


F, 


Sr 


Fr 




Tr 


Sr 


Tr 


S; 


(27) 


A, 


w. 


D, 




Rf 


Dr 


Tr 


I; 


(28) 


A, 


Wr 


Dr 


Nr 


Hr 


Sr 


Tr 


N; 


(29) 


M, 


Qr 




N, 


N, 


s. 




T; 


(30) 


H, 


Yr 


Dr 


Hr 


Rr 


D, 


Tr 


T; 


(31) 




N, 


s. 


H, 


Rf 






I; 


(32) 




Nr 


s. 


Hr 


Tr 


Sr 


Tr 


T; 


(33) 


A, 


w. 




N, 




Ar 


T, 


T; 


(34) 




Sr 


Tr 


Gr 


Rr 


Dr 


Tr 


I; 


(35) 


A, 


w. 


D, 


N, 


Rf 


D, 




I; 


(36) 


If 


Qr 


Er 


Hr 


Nr 


Sr 


Tr 


T; 


(37) 


F, 


s. 




A, 


Nr 


s. 


T. 


V; 


(38) 


F, 


Sr 


Fr 




Kr 


Dr 


Tr 


T; 


(39) 


M, 


A, 


D, 


N, 


N. 




T, 


T; 


(40) 


A, 


Wr 


Dr 


Nr 


Sr 


s. 


Vr 


T; 


(41) 


H, 


Qr 




s. 


Rr 




Tr 


I. 





















49. The method of claim 48 wherein said human growth hormone variant (1 1) further contains leudneis replaced 

by argininei5 and lystneies replaced by arginineies- 

15 

50. The method of claim 48 wherein said human growth hormone variant (40) further contains phenylalaninei 76 



51. A method for selecting novel binding polypeptides comprising: 



20 (a) constructing a replicable expression vector comprising 

a transcription regulatory element operabiy linked to 

a gene fusion encoding a fusion protein wherein the gene fusion comprises 

a first gene encoding a polypeptide operable connedted to a linking amino 

addsequence, and 

25 a second gene encoding at least a portion of a 

phage coat protein; 

( b ) mutating the vector at one or more selected posittons within the amino acM Hnking sequence of 
the first gene thereby fbnning a family of related plasmids; 



30 



(c ) transforming suitable host cells with the plasmUs; 



35 



( d ) infecting the transformed host cells with a helper phage 
having a gene encoding the phage coat protein; 



(e) culturing the transformed infected host cells under conditions 

suitable for fomning recombinant phagemid particles containing at least a portton of 
the plasmid and capable of transfomning the host, the conditk)ns adjusted so that no 
more than a minor amount of phagemki partk^les display more than one copy of the 
40 fusion protein on the surface of the particle; 



(f) 



contacting the phagemid particles with a target molecule so that at least a portion of the 
phagemid particles bind to the target molecule; and 



wo 92/09690 



80 



PCr/US91/09133 



(g) contacting the bound phagemid parU'des with a protease capable of hydroiysing the linking a 
amink) acid sequence of at teasta porfibn of the bound phagmki partides, and 

5 (h) isolating the hydrolyzedphagmkJ particles. 

52. The method of daim 51 further comprising infecting suitable host cells with the hydrolyzed phagemid 
parttoles and repeating steps (d) through (h). 
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